Abstract
Abstract
Purpose
To evaluate the ability of ChatGPT-4 to generate a biomedical review article on fertility preservation.
Methods
ChatGPT-4 was prompted to create an outline for a review on fertility preservation in men and prepubertal boys. The outline provided by ChatGPT-4 was subsequently used to prompt ChatGPT-4 to write the different parts of the review and provide five references for each section. The different parts of the article and the references provided were combined to create a single scientific review that was evaluated by the authors, who are experts in fertility preservation. The experts assessed the article and the references for accuracy and checked for plagiarism using online tools. In addition, both experts independently scored the relevance, depth, and currentness of the ChatGPT-4’s article using a scoring matrix ranging from 0 to 5 where higher scores indicate higher quality.
Results
ChatGPT-4 successfully generated a relevant scientific article with references. Among 27 statements needing citations, four were inaccurate. Of 25 references, 36% were accurate, 48% had correct titles but other errors, and 16% were completely fabricated. Plagiarism was minimal (mean = 3%). Experts rated the article’s relevance highly (5/5) but gave lower scores for depth (2–3/5) and currentness (3/5).
Conclusion
ChatGPT-4 can produce a scientific review on fertility preservation with minimal plagiarism. While precise in content, it showed factual and contextual inaccuracies and inconsistent reference reliability. These issues limit ChatGPT-4 as a sole tool for scientific writing but suggest its potential as an aid in the writing process.
Funder
National Institute of Child Health and Human Development
Publisher
Springer Science and Business Media LLC
Reference36 articles.
1. Roose K. The brilliance and weirdness of ChatGPT. 2023; Available from: https://www.nytimes.com/2022/12/05/technology/chatgpt-ai-twitter.html.
2. Stokel-Walker C. ChatGPT listed as author on research papers: many scientists disapprove. Nature 2013; Available from: https://www.nature.com/articles/d41586-023-00107-z.
3. Gao CA, Howard FM, Markov NS, Dyer EC, Ramesh S, Luo Y, Pearson AT. Comparing scientific abstracts generated by ChatGPT to original abstracts using an artificial intelligence output detector, plagiarism detector, and blinded human reviewers. bioRxiv. 2022;2022.12.23.521610.
4. Levin G, Meyer R, Kadoch E, Brezinov Y. Identifying ChatGPT-Written OBGYN abstracts using a simple tool. Am J Obstet Gynecol MFM. 2023;5(6):100936.
5. Else H. Abstracts written by ChatGPT fool scientists. Nature. 2023;613(7944):423.
Cited by
2 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献