ORKG-Leaderboards: a systematic workflow for mining leaderboards as a knowledge graph-Reference-Cited by-同舟云学术

ORKG-Leaderboards: a systematic workflow for mining leaderboards as a knowledge graph

Published:2023-06-15 Issue: Volume: Page:
ISSN:1432-5012
Container-title:International Journal on Digital Libraries
language:en
Short-container-title:Int J Digit Libr

Author:

Kabongo Salomon^ORCID,D’Souza Jennifer^ORCID,Auer Sören^ORCID

Abstract

AbstractThe purpose of this work is to describe the orkg-Leaderboard software designed to extract leaderboards defined as task–dataset–metric tuples automatically from large collections of empirical research papers in artificial intelligence (AI). The software can support both the main workflows of scholarly publishing, viz. as LaTeX files or as PDF files. Furthermore, the system is integrated with the open research knowledge graph (ORKG) platform, which fosters the machine-actionable publishing of scholarly findings. Thus, the systemsss output, when integrated within the ORKG’s supported Semantic Web infrastructure of representing machine-actionable ‘resources’ on the Web, enables: (1) broadly, the integration of empirical results of researchers across the world, thus enabling transparency in empirical research with the potential to also being complete contingent on the underlying data source(s) of publications; and (2) specifically, enables researchers to track the progress in AI with an overview of the state-of-the-art across the most common AI tasks and their corresponding datasets via dynamic ORKG frontend views leveraging tables and visualization charts over the machine-actionable data. Our best model achieves performances above 90% F1 on the leaderboard extraction task, thus proving orkg-Leaderboards a practically viable tool for real-world usage. Going forward, in a sense, orkg-Leaderboards transforms the leaderboard extraction task to an automated digitalization task, which has been, for a long time in the community, a crowdsourced endeavor.

Funder

Bundesministerium für Bildung und Forschung

FP7 Ideas: European Research Council

Publisher

Springer Science and Business Media LLC

Subject

Library and Information Sciences

Link

https://link.springer.com/content/pdf/10.1007/s00799-023-00366-1.pdf

Reference44 articles.

1. Parra Escartín, C., Reijers, W., Lynn, T., Moorkens, J., Way, A., Liu, C.-H.: Ethical considerations in NLP shared tasks. In: Proceedings of the First ACL Workshop on Ethics in Natural Language Processing, pp. 66–73. Association for Computational Linguistics, Valencia, Spain (2017). https://doi.org/10.18653/v1/W17-1608

2. Nissim, M., Abzianidze, L., Evang, K., van der Goot, R., Haagsma, H., Plank, B., Wieling, M.: Last words: sharing is caring: the future of shared tasks. Comput. Linguist. 43(4), 897–904 (2017)

3. Kim, J.-D., Pyysalo, S.: In: Dubitzky, W., Wolkenhauer, O., Cho, K.-H., Yokota, H. (eds.) BioNLP Shared Task, pp. 138–141. Springer, New York (2013). https://doi.org/10.1007/978-1-4419-9863-7_138

4. Jinha, A.E.: Article 50 million: an estimate of the number of scholarly articles in existence. Learn. Publ. 23(3), 258–263 (2010)

5. Chiarelli, A., Johnson, R., Richens, E., Pinfield, S.: Accelerating scholarly communication: the transformative role of preprints (2019)

Cited by 2 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Correction: ORKG-Leaderboards: a systematic workflow for mining leaderboards as a knowledge graph;International Journal on Digital Libraries;2024-05-28

2. CLEF 2024 SimpleText Track;Lecture Notes in Computer Science;2024