Evaluating human versus machine learning performance in classifying research abstracts-Reference-Cited by-同舟云学术

Evaluating human versus machine learning performance in classifying research abstracts

Published:2020-07-18 Issue:2 Volume:125 Page:1197-1212
ISSN:0138-9130
Container-title:Scientometrics
language:en
Short-container-title:Scientometrics

Author:

Goh Yeow Chong,Cai Xin Qing,Theseira Walter,Ko Giovanni,Khor Khiam Aik

Abstract

AbstractWe study whether humans or machine learning (ML) classification models are better at classifying scientific research abstracts according to a fixed set of discipline groups. We recruit both undergraduate and postgraduate assistants for this task in separate stages, and compare their performance against the support vectors machine ML algorithm at classifying European Research Council Starting Grant project abstracts to their actual evaluation panels, which are organised by discipline groups. On average, ML is more accurate than human classifiers, across a variety of training and test datasets, and across evaluation panels. ML classifiers trained on different training sets are also more reliable than human classifiers, meaning that different ML classifiers are more consistent in assigning the same classifications to any given abstract, compared to different human classifiers. While the top five percentile of human classifiers can outperform ML in limited cases, selection and training of such classifiers is likely costly and difficult compared to training ML models. Our results suggest ML models are a cost effective and highly accurate method for addressing problems in comparative bibliometric analysis, such as harmonising the discipline classifications of research from different funding agencies or countries.

Funder

National Research Foundation Singapore

Publisher

Springer Science and Business Media LLC

Subject

Library and Information Sciences,Computer Science Applications,General Social Sciences

Link

https://link.springer.com/content/pdf/10.1007/s11192-020-03614-2.pdf

Reference36 articles.

1. Aggarwal, C. C., & Zhai, C. (2012). A survey of text classification algorithms. In C. Aggarwal & C. Zhai (Eds.), Mining text data. Boston, MA: Springer. https://doi.org/10.1007/978-1-4614-3223-4_6.

2. Baeza-Yates, R., & Ribeiro-Neto, B. (1999). Modern information retrieval. New York, NY: ACM Press.

3. Bergstra, J., & Bengio, Y. (2012). Random search for hyper-parameter optimization. Journal of Machine Learning Research, 13, 281–305.

4. Braam, R. R., Moed, H. F., & Van Raan, A. F. (1991a). Mapping of science by combined co-citation and word analysis. I. Structural aspects. Journal of the American Society for information science, 42(4), 233–251.

5. Braam, R. R., Moed, H. F., & Van Raan, A. F. (1991b). Mapping of science by combined co-citation and word analysis. II: Dynamical aspects. Journal of the American Society for information science, 42(4), 252–266.

Cited by 22 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Using Natural Language Processing to Identify Stigmatizing Language in Labor and Birth Clinical Notes;Maternal and Child Health Journal;2023-12-26

2. Discovering Process Dynamics for Scalable Perovskite Solar Cell Manufacturing with Explainable AI;Advanced Materials;2023-12-07

3. A transfer learning approach to interdisciplinary document classification with keyword-based explanation;Scientometrics;2023-11-04

4. Characteristics of LIS research articles affecting their citation impact;Journal of Librarianship and Information Science;2023-09-05

5. Perspectives on Large Language Models for Relevance Judgment;Proceedings of the 2023 ACM SIGIR International Conference on Theory of Information Retrieval;2023-08-09