Affiliation:
1. TIB Leibniz Information Centre for Science and Technology, 30167 Hannover, Germany
Abstract
We introduce the Open Research Knowledge Graph Agriculture Named Entity Recognition (the ORKG Agri-NER) corpus and service for contribution-centric scientific entity extraction and classification. The ORKG Agri-NER corpus is a seminal benchmark for the evaluation of contribution-centric scientific entity extraction and classification in the agricultural domain. It comprises titles of scholarly papers that are available as Open Access articles on a major publishing platform. We describe the creation of this corpus and highlight the obtained findings in terms of the following features: (1) a generic conceptual formalism focused on capturing scientific entities in agriculture that reflect the direct contribution of a work; (2) a performance benchmark for named entity recognition of scientific entities in the agricultural domain by empirically evaluating various state-of-the-art sequence labeling neural architectures and transformer models; and (3) a delineated 3-step automatic entity resolution procedure for the resolution of the scientific entities to an authoritative ontology, specifically AGROVOC that is released in the Linked Open Vocabularies cloud. With this work we aim to provide a strong foundation for future work on the automatic discovery of scientific entities in the scholarly literature of the agricultural domain.
Funder
Federal Ministry of Education and Research
EU H2020 ERC project
Reference80 articles.
1. Johnson, R., Watkinson, A., and Mabe, M. (2018). The STM Report: An Overview of Scientific and Scholarly Publishing, International Association of Scientific, Technical and Medical Publishers.
2. Strategic reading, ontologies, and the future of scientific publishing;Renear;Science,2009
3. The FAIR Guiding Principles for scientific data management and stewardship;Wilkinson;Sci. Data,2016
4. Ammar, W., Groeneveld, D., Bhagavatula, C., Beltagy, I., Crawford, M., Downey, D., Dunkelberger, J., Elgohary, A., Feldman, S., and Ha, V. (2018, January 1–6). Construction of the Literature Graph in Semantic Scholar. Proceedings of the NAACL-HLT, New Orleans, LA, USA.
5. Improving access to scientific literature with knowledge graphs;Auer;Bibl. Forsch. Prax.,2020