From Relation Algebra to Semi-join Algebra: An Approach to Graph Query Optimization

Author:

Hellings Jelle1,Pilachowski Catherine L2,Van Gucht Dirk2,Gyssens Marc3,Wu Yuqing4

Affiliation:

1. Exploratory Systems Lab, Department of Computer Science, University of California, Davis, CA 95616-8562, USA, and Hasselt University, Faculty of Sciences, Martelarenlaan 42, 3500 Hasselt, Belgium

2. Indiana University, Luddy School of Informatics, Computing and Engineering, 919 E 10th Street, Bloomington, IN 47408, USA

3. Hasselt University, Faculty of Sciences, Data Science Institute, Martelarenlaan 42, 3500 Hasselt, Belgium

4. Pomona College, 185 E 6th Street, Claremont, CA 91711, USA

Abstract

Abstract Many graph query languages rely on composition to navigate graphs and select nodes of interest, even though evaluating compositions of relations can be costly. Often, this need for composition can be reduced by rewriting toward queries using semi-joins instead, resulting in a significant reduction of the query evaluation cost. We study techniques to recognize and apply such rewritings. Concretely, we study the relationship between the expressive power of the relation algebras, which heavily rely on composition, and the semi-join algebras, which replace composition in favor of semi-joins. Our main result is that each fragment of the relation algebras where intersection and/or difference is only used on edges (and not on complex compositions) is expressively equivalent to a fragment of the semi-join algebras. This expressive equivalence holds for node queries evaluating to sets of nodes. For practical relevance, we exhibit constructive rules for rewriting relation algebra queries to semi-join algebra queries and prove that they lead to only a well-bounded increase in the number of steps needed to evaluate the rewritten queries. In addition, on sibling-ordered trees, we establish new relationships among the expressive power of Regular XPath, Conditional XPath, FO-logic and the semi-join algebra augmented with restricted fixpoint operators.

Funder

National Science Foundation

Publisher

Oxford University Press (OUP)

Subject

General Computer Science

Reference63 articles.

1. Foundations of modern query languages for graph databases;Angles;ACM Comput. Surv.,2017

2. SPARQL 1.1 Query Language;Harris,2013

3. The Neo4j Cypher Manual v3.5;Neo4j Team,2019

4. Apache TinkerPop Documentation v3.4.4;Apache TinkerPop,2019

5. On the calculus of relations;Tarski;J. Symb. Log.,1941

Cited by 3 articles. 订阅此论文施引文献 订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献

1. Expressive Completeness of Two-Variable First-Order Logic with Counting for First-Order Logic Queries on Rooted Unranked Trees;2023 38th Annual ACM/IEEE Symposium on Logic in Computer Science (LICS);2023-06-26

2. An Algebra Supporting Semantic Query for Graph Data Model;2022 2nd International Conference on Consumer Electronics and Computer Engineering (ICCECE);2022-01-14

3. Optimizing Multiset Relational Algebra Queries Using Weak-Equivalent Rewrite Rules;Lecture Notes in Computer Science;2022

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3