Towards Building the Knowledge Graph for a Collection of Mathematical Articles
Author:
Gizatullin Bulat TimurovichORCID, Nevzoova Olga AvenirovnaORCID
Abstract
This paper describes the process of creating a knowledge graph for a collection of mathematical articles in the Russian language, gathered from the "Izvestiya VUZov. Matematika" journal. The collection consists of approximately 1100 documents in LaTex format. The work involves constructing an ontology for the collection of mathematical articles, which will serve as the basis for the created knowledge graph. Various article objects are extracted from the collection, including universal decimal classification codes, authors, titles, used formulas, articles publication dates, authors affiliations and references to other works. Each object is recorded through a specific relationship in the knowledge graph. Thematic modeling is also performed on the collection using the latent Dirichlet allocation method, for which optimal hyperparameters are selected. The document themes are recorded in the knowledge graph through relationships. An interesting approach is used for extracting mathematical terms. In this work, mathematical entities are identified in the documents using the OntoMathPRO ontology. During the knowledge graph construction process, tools were developed that allow the creation of a knowledge graph on any collection that meets the patterns of the original collection. The resulting knowledge graph can serve as a foundation for various research purposes and the development of intelligent systems, that can be used by researchers, journals, as well as students.
Publisher
Keldysh Institute of Applied Mathematics
Reference12 articles.
1. Hogan, A., Gutierrez, C., Cochez, M., et al.: Knowledge Graphs. Synthesis Lectures on Data, Semantics, and Knowledge, 237 p. Springer Cham (2022). 2. Lehmann, J., Isele, R., Jakob, M., et al. DBpedia – A large-scale, multilingual knowledge base extracted from Wikipedia. Semantic Web Journal, 6(2), 167–195 (2015). 3. Bollacker, K., Cook, R., Tufts, P.: Freebase: a shared database of structured general human knowledge. In: Proceedings of the 22nd National Conference on Artificial Intelligence, vol. 2, pp. 1962–1963 (2007). AAAI Press. 4. Vrandečić, D., and Krötzsch, M.: Wikidata: A free collaborative knowledge base. Communications of the ACM, 57(10), pp. 78–85 (2014). 5. Hoffart, J., Suchanek, F. M., Berberich, K., Lewis-Kelham, E., de Melo, G., and Weikum, G.: YAGO2: Exploring and querying world knowledge in time, space, context, and many languages. In: Srinivasan, S., Ramamritham, K., Kumar, A., et al. (eds.) Proc. of the 20th International Conference on World Wide Web, pp. 229–232, ACM Press, India, Hyderabad (2011).
|
|