Affiliation:
1. A.P. Ershov Institute of Informatics Systems, Siberian Branch, Russian Academy of Sciences
Abstract
This paper considers international and Russian-language data sources providing information about Russian research-related organizations. Information about research organizations is an important attribute that enables one to identify the authors of scientific publications, as well as to analyze the geographical distribution of publications and to assess the impact on the citation of the publications associated with geographic factors. However, information about national research organizations, for example, information about Russian research organizations, is often incomplete or distorted in international databases. Data sources such as GRID, Russian and English chapters of Wikipedia, Wikidata and eLIBRARY.ru are considered. It is demonstrated that Russian-language data sources contain more information about Russian research-related organizations than most international data sources, but this information is not available in English-language data sources. To solve this problem, a method for integrating information from multilingual data sources has been developed. Experiments on the comparison and integration of information about Russian research organizations in international and Russian data sources are outlined. An experimental version of the database of scientific organizations comprising 3143 scientific organizations has been created. The work is an intermediate step towards the creation of an open and extensible knowledge graph.
Publisher
Keldysh Institute of Applied Mathematics