Size and Treewidth Bounds for Conjunctive Queries


Gottlob Georg1,Lee Stephanie Tien1,Valiant Gregory2,Valiant Paul2


1. University of Oxford, UK

2. University of California, Berkeley


This article provides new worst-case bounds for the size and treewith of the result Q ( D ) of a conjunctive query Q applied to a database D . We derive bounds for the result size | Q ( D )| in terms of structural properties of Q , both in the absence and in the presence of keys and functional dependencies. These bounds are based on a novel “coloring” of the query variables that associates a coloring number C ( Q ) to each query Q . Intuitively, each color used represents some possible entropy of that variable. Using this coloring number, we derive tight bounds for the size of Q ( D ) in case (i) no functional dependencies or keys are specified, and (ii) simple functional dependencies (keys) are given. These results generalize recent size-bounds for join queries obtained by Atserias et al. [2008]. In the case of arbitrary (compound) functional dependencies, we use tools from information theory to provide lower and upper bounds, establishing a close connection between size bounds and a basic question in information theory. Our new coloring scheme also allows us to precisely characterize (both in the absence of keys and with simple keys) the treewidth-preserving queries---the queries for which the treewidth of the output relation is bounded by a function of the treewidth of the input database. Finally, we give some results on the computational complexity of determining the size bounds, and of deciding whether the treewidth is preserved.


Engineering and Physical Sciences Research Council


Association for Computing Machinery (ACM)


Artificial Intelligence,Hardware and Architecture,Information Systems,Control and Systems Engineering,Software

Reference37 articles.

1. Abiteboul S. Hull R. and Vianu V. 1995. Foundations of Databases. Addison-Wesley. Abiteboul S. Hull R. and Vianu V. 1995. Foundations of Databases . Addison-Wesley.

2. The theory of joins in relational databases

3. Equivalences among Relational Expressions

4. Easy problems for tree-decomposable graphs

5. Size Bounds and Query Plans for Relational Joins

Cited by 28 articles. 订阅此论文施引文献 订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献

1. ADOPT: Adaptively Optimizing Attribute Orders for Worst-Case Optimal Join Algorithms via Reinforcement Learning;Proceedings of the VLDB Endowment;2023-07

2. Applications of Information Inequalities to Database Theory Problems;2023 38th Annual ACM/IEEE Symposium on Logic in Computer Science (LICS);2023-06-26

3. SafeBound: A Practical System for Generating Cardinality Bounds;Proceedings of the ACM on Management of Data;2023-05-26

4. A Nearly Instance-optimal Differentially Private Mechanism for Conjunctive Queries;Proceedings of the 41st ACM SIGMOD-SIGACT-SIGAI Symposium on Principles of Database Systems;2022-06-12

5. Query Evaluation by Circuits;Proceedings of the 41st ACM SIGMOD-SIGACT-SIGAI Symposium on Principles of Database Systems;2022-06-12







Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3