Abstract
To build a full picture of previous studies on the origins of SARS-CoV-2 (severe acute respiratory syndrome coronavirus 2), this paper exploits an active learning-based approach to screen scholarly articles about the origins of SARS-CoV-2 from many scientific publications. In more detail, six seed articles were utilized to manually curate 170 relevant articles and 300 nonrelevant articles. Then, an active learning-based approach with three query strategies and three base classifiers is trained to screen the articles about the origins of SARS-CoV-2. Extensive experimental results show that our active learning-based approach outperforms traditional counterparts, and the uncertain sampling query strategy performs best among the three strategies. By manually checking the top 1,000 articles of each base classifier, we ultimately screened 715 unique scholarly articles to create a publicly available peer-reviewed literature corpus, COVID-Origin. This indicates that our approach for screening articles about the origins of SARS-CoV-2 is feasible.
Funder
National Natural Science Foundation of China
Publisher
Public Library of Science (PLoS)
Reference74 articles.
1. A new coronavirus associated with human respiratory disease in China;F Wu;Nature,2010
2. Opinion: To Stop the next pandemic, we need to unravel the origins of COVID-19;DA Relman;Proceedings of the National Academy of Sciences of the United States of America,2020
3. Serological Evidence of Bat SARS-related Coronavirus Infection in Humans, China.;N Wang;Virologica Sinica.,2018
4. Review of Ebola virus infections in domestic animals.;HM Weingartl;Developments in Biologicals.,2013
5. Publishing volumes in major databases related to Covid-19;J. A. T da Silva;Scientometrics,2021
Cited by
3 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献