Abstract
AbstractKnowledge graphs (KGs) are useful data structures for the integration, retrieval, dissemination, and inference of information in various information domains. One of the main challenges in building KGs is the extraction of named entities (nodes) and their relations (edges), particularly when processing unstructured text as it has no semantic descriptions. Generating KGs from texts written in Spanish represents a research challenge as the existing structures, models, and strategies designed for other languages are not compatible in this scenario. This paper proposes a method to design and construct KGs from unstructured text in Spanish. We defined lexical patterns to extract named entities and (non) taxonomic, equivalence, and composition relations. Next, named entities are linked and enriched with DBpedia resources through a strategy based on SPARQL queries. Finally, OWL properties are defined from the predicate relations for creating resource description framework (RDF) triples. We evaluated the performance of the proposed method to determine the degree of elements extracted from the input text and to assess their quality through standard information retrieval measures. The evaluation revealed the feasibility of the proposed method to extract RDF triples from datasets in general and computer science domains. Competitive results were observed by comparing our method regarding an existing approach from the literature.
Funder
secretaría de educación pública
Publisher
Springer Science and Business Media LLC
Subject
Computational Mathematics,Engineering (miscellaneous),Information Systems,Artificial Intelligence
Reference45 articles.
1. Al-Moslmi T, Ocaña MG, Opdahl AL, Veres C (2020) Named entity extraction for knowledge graphs: a literature overview. IEEE Access 8:32862–32881. https://doi.org/10.1109/ACCESS.2020.2973928
2. Bowker L (2003) Lexical knowledge patterns, semantic relations, and language varieties: exploring the possibilities for refining information retrieval in an international context. Catalog Classif Quart 37(1–2):153–171
3. Buscaldi D, Dessı D, Motta E, Osborne F, Recupero DR (2019) Mining scholarly data for fine-grained knowledge graph construction. In: Proceedings of the workshop on deep learning for knowledge graphs (DL4KG2019) at ESWC2019
4. Carriero VA, Gangemi A, Mancinelli ML, Marinucci L, Nuzzolese AG, Presutti V, Veninata C (2019) Arco: The Italian cultural heritage knowledge graph. In: The Semantic Web—ISWC 2019—18th International Semantic Web Conference, Auckland, New Zealand, October 26-30, 2019, Proceedings, Part II, Springer, Lecture Notes in Computer Science, vol 11779, pp 36–52. https://doi.org/10.1007/978-3-030-30796-7_3
5. Chen P, Lu Y, Zheng VW, Chen X, Yang B (2018) Knowedu: A system to construct knowledge graph for education. IEEE Access 6:31553–31563. https://doi.org/10.1109/ACCESS.2018.2839607
Cited by
1 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献