Learning to fake it: limited responses and fabricated references provided by ChatGPT for medical questions-Reference-Cited by-同舟云学术

Learning to fake it: limited responses and fabricated references provided by ChatGPT for medical questions

Published:2023-03-24 Issue: Volume: Page:
ISSN:
Container-title:
language:
Short-container-title:

Author:

Gravel Jocelyn^ORCID,D’Amours-Gravel Madeleine,Osmanlliu Esli^ORCID

Abstract

AbstractBackgroundChatGPT have gained public notoriety and recently supported manuscript preparation. Our objective was to evaluate the quality of the answers and the references provided by ChatGPT for medical questions.MethodsThree researchers asked ChatGPT a total of 20 medical questions and prompted it to provide the corresponding references. The responses were evaluated for quality of content by medical experts using a verbal numeric scale going from 0 to 100%. These experts were the corresponding author of the 20 articles from where the medical questions were derived. We planned to evaluate three references per response for their pertinence, but this was amended based on preliminary results showing that most references provided by ChatGPT were fabricated.ResultsChatGPT provided responses varying between 53 and 244 words long and reported two to seven references per answer. Seventeen of the 20 invited raters provided feedback. The raters reported limited quality of the responses with a median score of 60% (1stand 3rdquartile: 50% and 85%). Additionally, they identified major (n=5) and minor (n=7) factual errors among the 17 evaluated responses. Of the 59 references evaluated, 41 (69%) were fabricated, though they appeared real. Most fabricated citations used names of authors with previous relevant publications, a title that seemed pertinent and a credible journal format.InterpretationWhen asked multiple medical questions, ChatGPT provided answers of limited quality for scientific publication. More importantly, ChatGPT provided deceptively real references. Users of ChatGPT should pay particular attention to the references provided before integration into medical manuscripts.

Publisher

Cold Spring Harbor Laboratory

Reference43 articles.

1. Kitamura FC . ChatGPT Is Shaping the Future of Medical Writing but Still Requires Human Judgment. Radiology. 2023:230171.

2. ChatGPT: Optimizing language models for dialogue. : OpenAI; [updated November 30, 2022. Available from: https://openai.com/blog/chatgpt/.

3. Biswas S. ChatGPT and the Future of Medical Writing. Radiology. 2023:223312.

4. Open artificial intelligence platforms in nursing education: Tools for academic progress or abuse?

5. Rapamycin in the context of Pascal’s Wager: generative pre-trained transformer perspective

Cited by 11 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. ChatGPT and the Technology-Education Tension: Applying Contextual Virtue Epistemology to a Cognitive Artifact;Philosophy & Technology;2024-01-27

2. Muzzling Misinformation: Drawing from Other Disciplines and Engaging Health and Science Journalists as Research Collaborators;Palgrave Handbook of Science and Health Journalism;2024

3. The risks of using ChatGPT to obtain common safety-related information and advice;Safety Science;2023-11

4. Assessing the research landscape and clinical utility of large language models: A scoping review;2023-10-30

5. Hallucination Rates and Reference Accuracy of ChatGPT and Bard for Systematic Reviews: Comparative Analysis (Preprint);2023-09-27