Robust Cardinality: a novel approach for cardinality prediction in SQL queries-Reference-Cited by-同舟云学术

Robust Cardinality: a novel approach for cardinality prediction in SQL queries

Published:2021-09-01 Issue:1 Volume:27 Page:
ISSN:0104-6500
Container-title:Journal of the Brazilian Computer Society
language:en
Short-container-title:J Braz Comput Soc

Author:

B. S. Praciano Francisco D.^ORCID,Amora Paulo R. P.,Abreu Italo C.,Pereira Francisco L. F.,Machado Javam C.

Abstract

Abstract Background Database Management Systems (DBMSs) use declarative language to execute queries to stored data. The DBMS defines how data will be processed and ultimately retrieved. Therefore, it must choose the best option from the different possibilities based on an estimation process. The optimization process uses estimated cardinalities to make optimization decisions, such as choosing predicate order. Methods In this paper, we propose Robust Cardinality, an approach to calculate cardinality estimates of query operations to guide the execution engine of the DBMSs to choose the best possible form or at least avoid the worst one. By using machine learning, instead of the current histogram heuristics, it is possible to improve these estimates; hence, leading to more efficient query execution. Results We perform experimental tests using PostgreSQL, comparing both estimators and a modern technique proposed in the literature. With Robust Cardinality, a lower estimation error of a batch of queries was obtained and PostgreSQL executed these queries more efficiently than when using the default estimator. We observed a 3% reduction in execution time after reducing 4 times the query estimation error. Conclusions From the results, it is possible to conclude that this new approach results in improvements in query processing in DBMSs, especially in the generation of cardinality estimates.

Funder

Coordena??o de Aperfei?oamento de Pessoal de N?vel Superior

Publisher

Springer Science and Business Media LLC

Subject

General Computer Science

Link

https://link.springer.com/content/pdf/10.1186/s13173-021-00115-9.pdf

Reference43 articles.

1. Kooi RP (1980) The optimization of queries in relational databases. PhD thesis. Case Western Reserve University, Cleveland, OH, USA. AAI8109596.

2. Leis V, Gubichev A, Mirchev A, Boncz PA, Kemper A, Neumann T (2015) How good are query optimizers, really?Proc VLDB Endowment 9(3):204–215.

3. Leis V, Radke B, Gubichev A, Mirchev A, Boncz PA, Kemper A, Neumann T (2018) Query optimization through the looking glass, and what we found running the join order benchmark. VLDB J 27(5):643–668.

4. Ioannidis YE, Christodoulakis S (1991) On the propagation of errors in the size of join results In: Proceedings of the 1991 ACM SIGMOD International Conference on Management of Data, 268–277. https://doi.org/10.1145/115790.115835.

5. Moerkotte G, Neumann T, Steidl G (2009) Preventing bad plans by bounding the impact of cardinality estimation errors. Proc VLDB Endowment 2(1):982–993.