Cardinality Estimation over Knowledge Graphs with Embeddings and Graph Neural Networks

Author:

Schwabe Tim1ORCID,Acosta Maribel2ORCID

Affiliation:

1. TUM School of Computation, Information and Technology, Technical University of Munich & Ruhr University Bochum, Munich, Germany

2. TUM School of Computation, Information and Technology, Technical University of Munich, Munich, Germany

Abstract

Cardinality Estimation over Knowledge Graphs (KG) is crucial for query optimization, yet remains a challenging task due to the semi-structured nature and complex correlations of data in typical KGs. In this work, we propose GNCE, a novel approach that leverages knowledge graph embeddings and Graph Neural Networks (GNN) to accurately predict the cardinality of conjunctive queries over KGs. GNCE first creates semantically meaningful embeddings for all entities in the KG, which are then used to learn a representation of a query using a GNN to estimate the cardinality of the query. We evaluate GNCE on several KGs in terms of q-Error and demonstrate that it outperforms state-of-the-art approaches based on sampling, summaries, and (machine) learning in terms of estimation accuracy while also having a low execution time and few parameters. Additionally, we show that GNCE performs similarly well on real-world queries and can inductively generalize to unseen entities, making it suitable for use in dynamic query processing scenarios. Our proposed approach has the potential to significantly improve query optimization and related applications that rely on accurate cardinality estimates of conjunctive queries.

Publisher

Association for Computing Machinery (ACM)

Reference60 articles.

1. An analytical study of large SPARQL query logs

2. Michael M. Bronstein, Joan Bruna, Taco Cohen, and 2021. Geometric Deep Learning: Grids, Groups, Graphs, Geodesics, and Gauges. (4 2021). http://arxiv.org/abs/2104.13478

3. Daniel Casals Carlos Buil-Aranda and Carlos Valle. [n. d.]. SPARQL query execution time prediction using Deep Learning. ( [n. d.]).

4. Measuring and Relieving the Over-Smoothing Problem for Graph Neural Networks from the Topological View

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3