Construction of Knowledge Graphs: Current State and Challenges
-
Published:2024-08-22
Issue:8
Volume:15
Page:509
-
ISSN:2078-2489
-
Container-title:Information
-
language:en
-
Short-container-title:Information
Author:
Hofer Marvin1ORCID, Obraczka Daniel1ORCID, Saeedi Alieh2ORCID, Köpcke Hanna13ORCID, Rahm Erhard12ORCID
Affiliation:
1. Center for Scalable Data Analytics and Artificial Intelligence (ScaDS.AI) Dresden/Leipzig, 04105 Leipzig, Germany 2. Department of Computer Science, Leipzig University, 04109 Leipzig, Germany 3. Faculty Applied Computer Sciences & Biosciences, University of Applied Sciences Mittweida, 09648 Mittweida, Germany
Abstract
With Knowledge Graphs (KGs) at the center of numerous applications such as recommender systems and question-answering, the need for generalized pipelines to construct and continuously update such KGs is increasing. While the individual steps that are necessary to create KGs from unstructured sources (e.g., text) and structured data sources (e.g., databases) are mostly well researched for their one-shot execution, their adoption for incremental KG updates and the interplay of the individual steps have hardly been investigated in a systematic manner so far. In this work, we first discuss the main graph models for KGs and introduce the major requirements for future KG construction pipelines. Next, we provide an overview of the necessary steps to build high-quality KGs, including cross-cutting topics such as metadata management, ontology development, and quality assurance. We then evaluate the state of the art of KG construction with respect to the introduced requirements for specific popular KGs, as well as some recent tools and strategies for KG construction. Finally, we identify areas in need of further research and improvement.
Funder
Federal Ministry of Education and Research of Germany Sächsische Staatsministerium für Wissenschaft Kultur und Tourismus in the program Center of Excellence for AI-research
Reference394 articles.
1. Huang, X., Zhang, J., Li, D., and Li, P. (2019, January 11–15). Knowledge Graph Embedding Based Question Answering. Proceedings of the Twelfth ACM International Conference on Web Search and Data Mining, WSDM 2019, Melbourne, VIC, Australia. 2. Wang, X., He, X., Cao, Y., Liu, M., and Chua, T. (2019, January 4–8). KGAT: Knowledge Graph Attention Network for Recommendation. Proceedings of the 25th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, KDD 2019, Anchorage, AK, USA. 3. Discovering protein drug targets using Knowledge Graph embeddings;Mohamed;Bioinformatics,2019 4. Oberkampf, H., Zillner, S., and Bauer, B. (2012, January 21–25). Interpreting Patient Data using Medical Background Knowledge. Proceedings of the 3rd International Conference on Biomedical Ontology (ICBO 2012), KR-MED Series, Graz, Austria. 5. The Clinical Data Intelligence Project—A smart data initiative;Sonntag;Inform. Spektrum,2016
|
|