Early identification of scientific breakthroughs through outlier analysis based on research entities
Author:
Zhao Yang1, Zhang Mengting12, Chen Xiaoli1, Zhang Zhixiong12
Affiliation:
1. National Science Library, Chinese Academy of Sciences , Beijing , China 2. Department of Information Resources Management, School of Economics and Management, University of Chinese Academy of Sciences , Beijing , China
Abstract
Abstract
Purpose
To address the “anomalies” that occur when scientific breakthroughs emerge, this study focuses on identifying early signs and nascent stages of breakthrough innovations from the perspective of outliers, aiming to achieve early identification of scientific breakthroughs in papers.
Design/methodology/approach
This study utilizes semantic technology to extract research entities from the titles and abstracts of papers to represent each paper’s research content. Outlier detection methods are then employed to measure and analyze the anomalies in breakthrough papers during their early stages. The development and evolution process are traced using literature time tags. Finally, a case study is conducted using the key publications of the 2021 Nobel Prize laureates in Physiology or Medicine.
Findings
Through manual analysis of all identified outlier papers, the effectiveness of the proposed method for early identifying potential scientific breakthroughs is verified.
Research limitations
The study’s applicability has only been empirically tested in the biomedical field. More data from various fields are needed to validate the robustness and generalizability of the method.
Practical implications
This study provides a valuable supplement to current methods for early identification of scientific breakthroughs, effectively supporting technological intelligence decision-making and services.
Originality/Value
The study introduces a novel approach to early identification of scientific breakthroughs by leveraging outlier analysis of research entities, offering a more sensitive, precise, and fine-grained alternative method compared to traditional citation-based evaluations, which enhances the ability to identify nascent breakthrough innovations.
Publisher
Walter de Gruyter GmbH
Reference16 articles.
1. Bansal, R., Gaur, N., & Singh, S. N. (2016). Outlier detection: applications and techniques in data mining. 2016 6th International conference-cloud system and big data engineering (Confluence), Noida, India, 373–377, doi: 10.1109/CONFLUENCE.2016.7508146 2. Chen, X., & Han, T. (2021). Early Detection of Emerging Technologies using Temporal Features. Predicting the Dynamics of Research Impact, 253–269. 3. Gu, Y., Tinn, R., Cheng, H., Lucas, M., Usuyama, N., Liu, X., Naumann, T., Gao, J., & Poon, H. (2021). Domain-specific language model pretraining for biomedical natural language processing. ACM Transactions on Computing for Healthcare (HEALTH), 3(1), 1–23. 4. Hollingsworth, J. R. (2008). Scientific discoveries: An institutionalist and path-dependent perspective. Biomedical and Health Research-Commission of the European Communities Then IOS Press, 72, 317. 5. Kuhn, T. S. (1997). The structure of scientific revolutions (Vol. 962). University of Chicago Press.
|
|