Hashing the Hypertrie: Space- and Time-Efficient Indexing for SPARQL in Tensors-Reference-Cited by-同舟云学术

Hashing the Hypertrie: Space- and Time-Efficient Indexing for SPARQL in Tensors

Published:2022 Issue: Volume: Page:57-73
ISSN:0302-9743
Container-title:The Semantic Web – ISWC 2022
language:
Short-container-title:

Author:

Bigerl Alexander^ORCID,Conrads Lixi^ORCID,Behning Charlotte^ORCID,Saleem Muhammad^ORCID,Ngonga Ngomo Axel-Cyrille^ORCID

Abstract

AbstractTime-efficient solutions for querying RDF knowledge graphs depend on indexing structures with low response times to answer SPARQL queries rapidly. Hypertries—an indexing structure we recently developed for tensor-based triple stores—have achieved significant runtime improvements over several mainstream storage solutions for RDF knowledge graphs. However, the space footprint of this novel data structure is still often larger than that of many mainstream solutions. In this work, we detail means to reduce the memory footprint of hypertries and thereby further speed up query processing in hypertrie-based RDF storage solutions. Our approach relies on three strategies: (1) the elimination of duplicate nodes via hashing, (2) the compression of non-branching paths, and (3) the storage of single-entry leaf nodes in their parent nodes. We evaluate these strategies by comparing them with baseline hypertries as well as popular triple stores such as Virtuoso, Fuseki, GraphDB, Blazegraph and gStore. We rely on four datasets/benchmark generators in our evaluation: SWDF, DBpedia, WatDiv, and WikiData. Our results suggest that our modifications significantly reduce the memory footprint of hypertries by up to 70% while leading to a relative improvement of up to 39% with respect to average Queries per Second and up to 740% with respect to Query Mixes per Hour.

Publisher

Springer International Publishing

Link

https://link.springer.com/content/pdf/10.1007/978-3-031-19433-7_4

Reference22 articles.

1. Ali, W., Saleem, M., Yao, B., Hogan, A., Ngomo, A.C.N.: A survey of rdf stores & sparql engines for querying knowledge graphs (2021)

2. Lecture Notes in Computer Science;G Aluç,2014

3. Arroyuelo, D., Hogan, A., Navarro, G., Reutter, J.L., Rojas-Ledesma, J., Soto, A.: Worst-case optimal graph joins in almost no space, pp. 102–114. Association for Computing Machinery, New York (2021). https://doi.org/10.1145/3448016.3457256

4. Atre, M., Chaoji, V., Zaki, M.J., Hendler, J.A.: Matrix “bit” loaded: a scalable lightweight join query processor for RDF data. In: Proceedings of the 19th International Conference on World Wide Web, WWW 2010, pp. 41–50. Association for Computing Machinery, New York (2010). https://doi.org/10.1145/1772690.1772696

5. Atserias, A., Grohe, M., Marx, D.: Size bounds and query plans for relational joins. In: 49th Annual IEEE Symposium on Foundations of Computer Science, FOCS 2008, Philadelphia, PA, USA, 25–28 October 2008, pp. 739–748. IEEE Computer Society (2008). https://doi.org/10.1109/FOCS.2008.43

Cited by 4 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Efficient Evaluation of Conjunctive Regular Path Queries Using Multi-way Joins;Lecture Notes in Computer Science;2024

2. Evaluating Negation with Multi-way Joins Accelerates Class Expression Learning;Lecture Notes in Computer Science;2024

3. Native Execution of GraphQL Queries over RDF Graphs Using Multi-Way Joins;Knowledge Graphs: Semantics, Machine Learning, and Languages;2023-09-11

4. Chapter 13. Class Expression Learning with Multiple Representations;Frontiers in Artificial Intelligence and Applications;2023-07-21