Practical selectivity estimation through adaptive sampling-Reference-Cited by-同舟云学术

Practical selectivity estimation through adaptive sampling

Published:1990-05 Issue:2 Volume:19 Page:1-11
ISSN:0163-5808
Container-title:ACM SIGMOD Record
language:en
Short-container-title:SIGMOD Rec.

Author:

Lipton Richard J.¹,Naughton Jeffrey F.²,Schneider Donovan A.²

Affiliation:

1. Department of Computer Science, Princeton University

2. Department of Computer Sciences, University of Wisconsin

Abstract

Recently we have proposed an adaptive, random sampling algorithm for general query size estimation. In earlier work we analyzed the asymptotic efficiency and accuracy of the algorithm, in this paper we investigate its practicality as applied to selects and joins. First, we extend our previous analysis to provide significantly improved bounds on the amount of sampling necessary for a given level of accuracy. Next, we provide “sanity bounds” to deal with queries for which the underlying data is extremely skewed or the query result is very small. Finally, we report on the performance of the estimation algorithm as implemented in a host language on a commercial relational system. The results are encouraging, even with this loose coupling between the estimation algorithm and the DBMS.

Publisher

Association for Computing Machinery (ACM)

Subject

Information Systems,Software

Link

https://dl.acm.org/doi/pdf/10.1145/93605.93611

Reference23 articles.

1. D B~tton D DeWltt and C Turbyfill Benchmarkmg database systems A systematic approach In Proc Nznth VLDB pages 8-19 1983 D B~tton D DeWltt and C Turbyfill Benchmarkmg database systems A systematic approach In Proc Nznth VLDB pages 8-19 1983

2. S Chnstodoulakls Estimating block transfers and join sizes In Proc A CM SIGMOD Conference pages 40-54 May 1983 10.1145/582192.582204 S Chnstodoulakls Estimating block transfers and join sizes In Proc A CM SIGMOD Conference pages 40-54 May 1983 10.1145/582192.582204

3. R Demolombe Estimation of the number of tuples satxsfymg a query expressed in predicate calculus language In Proc Szzth VLDB pages 55-63 1980 R Demolombe Estimation of the number of tuples satxsfymg a query expressed in predicate calculus language In Proc Szzth VLDB pages 55-63 1980

4. J Fedorowlcz Database performance evaluation using multiple regresmon techniques In Proc A CM SIGMOD Conference pages 70-76 June 1984 10.1145/602259.602269 J Fedorowlcz Database performance evaluation using multiple regresmon techniques In Proc A CM SIGMOD Conference pages 70-76 June 1984 10.1145/602259.602269

Cited by 74 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. ReJOOSp: Reinforcement Learning for Join Order Optimization in SPARQL;Big Data and Cognitive Computing;2024-06-27

2. Automating localized learning for cardinality estimation based on XGBoost;Knowledge and Information Systems;2024-06-01

3. Duet: Efficient and Scalable Hybrid Neural Relation Understanding;2024 IEEE 40th International Conference on Data Engineering (ICDE);2024-05-13

4. RelJoin: Relative-cost-based selection of distributed join methods for query plan optimization;Information Sciences;2024-02

5. Selectivity Estimation for Queries Containing Predicates over Set-Valued Attributes;Proceedings of the ACM on Management of Data;2023-12-08