Affiliation:
1. Sano Centre for Computer Medicine
2. Research Center for Knowledge Engineering, Zhejiang Lab
Abstract
Abstract
ChatGPT is becoming a new reality. In this paper, we show how to distinguish ChatGPT-generated publications from counterparts produced by scientists. Using a newly designed supervised Machine Learning algorithm, we demonstrate how to detect machine-generated publications from ones produced by scientists. The algorithm was trained using 100 real publications, calibrated by 10-fold of real publications. When comparing the training with calibration, we found that the similarities fluctuated between (19%-21%) of bigram overlaps. The calibrating folds contributed (51%-70%) of new bigrams, while ChatGPT contributed only 23% (> 50% of any of the other 10 calibrating folds). When classifying the individual articles, the xFakeBibs algorithm predicted 98/100 publications as fake, while 2 articles failed the test and were classified as real publications. We introduced an algorithmic approach that detected the ChatGPT-generated articles with a high degree of accuracy. However, it remains challenging to detect all fake records. This work is indeed a step in the right direction to counter fake science and misinformation.
Publisher
Research Square Platform LLC
Reference35 articles.
1. Synnestvedt MB, Chen C, Holmes JH. CiteSpace II: Visualization and Knowledge Discovery in Bibliographic Databases. AMIA Annu Symp Proc. 2005;2005:724–728.
2. Holzinger A, Ofner B, Stocker C, et al. On Graph Entropy Measures for Knowledge Discovery from Publication Network Data. In: Cuzzocrea A, Kittl C, Simos DE, Weippl E, Xu L, eds. Availability, Reliability, and Security in Information Systems and HCI. Lecture Notes in Computer Science. Springer; 2013:354–362. doi:10.1007/978-3-642-40511-2_25
3. Knowledge discovery out of text data: a systematic review via text mining;Usai A;J Knowl Manag,2018
4. Fish tales: Combating fake science in popular media;Thaler AD;Ocean Coast Manag,2015
5. Fake science and the knowledge crisis: ignorance can be fatal;Hopf H;R Soc Open Sci,2019
Cited by
3 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献