Parallel pipelined filter ordering with precedence constraints-Reference-Cited by-同舟云学术

Parallel pipelined filter ordering with precedence constraints

Published:2012-09 Issue:4 Volume:8 Page:1-38
ISSN:1549-6325
Container-title:ACM Transactions on Algorithms
language:en
Short-container-title:ACM Trans. Algorithms

Author:

Deshpande Amol¹,Hellerstein Lisa²

Affiliation:

1. University of Maryland College Park, MD

2. Polytechnic Institute of NYU, Brooklyn, NY

Abstract

In the parallel pipelined filter ordering problem, we are given a set of n filters that run in parallel. The filters need to be applied to a stream of elements, to determine which elements pass all filters. Each filter has a rate limit r i on the number of elements it can process per unit time, and a selectivity p i , which is the probability that a random element will pass the filter. The goal is to maximize throughput. This problem appears naturally in a variety of settings, including parallel query optimization in databases and query processing over Web services. We present an O ( n 3 ) algorithm for this problem, given tree-structured precedence constraints on the filters. This extends work of Condon et al. [2009] and Kodialam [2001], who presented algorithms for solving the problem without precedence constraints. Our algorithm is combinatorial and produces a sparse solution. Motivated by join operators in database queries, we also give algorithms for versions of the problem in which “filter” selectivities may be greater than or equal to 1. We prove a strong connection between the more classical problem of minimizing total work in sequential filter ordering (A), and the parallel pipelined filter ordering problem (B). More precisely, we prove that A is solvable in polynomial time for a given class of precedence constraints if and only if B is as well. This equivalence allows us to show that B is NP-Hard in the presence of arbitrary precedence constraints (since A is known to be NP-Hard in that setting).

Funder

Division of Computing and Communication Foundations

National Science Foundation

Division of Information and Intelligent Systems

Publisher

Association for Computing Machinery (ACM)

Subject

Mathematics (miscellaneous)

Link

https://dl.acm.org/doi/pdf/10.1145/2344422.2344431

Reference26 articles.

1. Eddies

2. Adaptive ordering of pipelined stream filters

3. Feature Article—The Ellipsoid Method: A Survey

4. Join queries with external text sources

Cited by 8 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Top-k query optimization over data services;Future Generation Computer Systems;2020-12

2. Evaluation of Heuristics for Product Data Models;Business Process Management Workshops;2020

3. On Knowledge Transfer from Cost-Based Optimization of Data-Centric Workflows to Business Process Redesign;Transactions on Large-Scale Data- and Knowledge-Centered Systems XLIII;2020

4. Optimization of data flow execution in a parallel environment;Distributed and Parallel Databases;2018-08-22

5. The many faces of data-centric workflow optimization: a survey;International Journal of Data Science and Analytics;2018-03-06