Affiliation:
1. Ontology Engineering Group, Universidad Politécnica de Madrid, Spain
2. Declarative Languages and Artificial Intelligence Group, KU Leuven, Belgium
3. Flanders Make, DTAI-FET, Belgium
Abstract
Knowledge graphs are often constructed from heterogeneous data sources, using declarative rules that map them to a target ontology and materializing them into RDF. When these data sources are large, the materialization of the entire knowledge graph may be computationally expensive and not suitable for those cases where a rapid materialization is required. In this work, we propose an approach to overcome this limitation, based on the novel concept of mapping partitions. Mapping partitions are defined as groups of mapping rules that generate disjoint subsets of the knowledge graph. Each of these groups can be processed separately, reducing the total amount of memory and execution time required by the materialization process. We have included this optimization in our materialization engine Morph-KGC, and we have evaluated it over three different benchmarks. Our experimental results show that, compared with state-of-the-art techniques, the use of mapping partitions in Morph-KGC presents the following advantages: (i) it decreases significantly the time required for materialization, (ii) it reduces the maximum peak of memory used, and (iii) it scales to data sizes that other engines are not capable of processing currently.
Subject
Computer Networks and Communications,Computer Science Applications,Information Systems
Reference41 articles.
1. J. Arenas-Guerrero, M. Scrocca, A. Iglesias-Molina, J. Toledo, L. Pozo-Gilo, D. Doña, O. Corcho and D. Chaves-Fraga, Knowledge graph construction with R2RML and RML: An ETL system-based overview, in: Proceedings of the 2nd International Workshop on Knowledge Graph Construction, CEUR Workshop Proceedings, Vol. 2873, CEUR-WS.org, 2021.
2. The Berlin SPARQL benchmark;Bizer;International Journal on Semantic Web and Information Systems, IJSWIS,2009
3. Ontop: Answering SPARQL queries over relational databases;Calvanese;Semantic Web,2017
4. Logic-based approach to semantic query optimization;Chakravarthy;ACM Transactions on Database Systems,1990
5. What Are the Parameters that Affect the Construction of a Knowledge Graph?
Cited by
26 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献