Examining the Validity of ChatGPT in Identifying Relevant Nephrology Literature: Findings and Implications-Reference-Cited by-同舟云学术

Examining the Validity of ChatGPT in Identifying Relevant Nephrology Literature: Findings and Implications

Published:2023-08-25 Issue:17 Volume:12 Page:5550
ISSN:2077-0383
Container-title:Journal of Clinical Medicine
language:en
Short-container-title:JCM

Author:

Suppadungsuk Supawadee¹²^ORCID,Thongprayoon Charat¹,Krisanapan Pajaree¹³^ORCID,Tangpanithandee Supawit¹²^ORCID,Garcia Valencia Oscar¹^ORCID,Miao Jing¹^ORCID,Mekraksakit Poemlarp¹^ORCID,Kashani Kianoush¹^ORCID,Cheungpasitporn Wisit¹^ORCID

Affiliation:

1. Division of Nephrology and Hypertension, Department of Medicine, Mayo Clinic, Rochester, MN 55905, USA

2. Chakri Naruebodindra Medical Institute, Faculty of Medicine Ramathibodi Hospital, Mahidol University, Samut Prakan 10540, Thailand

3. Division of Nephrology, Thammasat University Hospital, Pathum Thani 12120, Thailand

Abstract

Literature reviews are valuable for summarizing and evaluating the available evidence in various medical fields, including nephrology. However, identifying and exploring the potential sources requires focus and time devoted to literature searching for clinicians and researchers. ChatGPT is a novel artificial intelligence (AI) large language model (LLM) renowned for its exceptional ability to generate human-like responses across various tasks. However, whether ChatGPT can effectively assist medical professionals in identifying relevant literature is unclear. Therefore, this study aimed to assess the effectiveness of ChatGPT in identifying references to literature reviews in nephrology. We keyed the prompt “Please provide the references in Vancouver style and their links in recent literature on… name of the topic” into ChatGPT-3.5 (03/23 Version). We selected all the results provided by ChatGPT and assessed them for existence, relevance, and author/link correctness. We recorded each resource’s citations, authors, title, journal name, publication year, digital object identifier (DOI), and link. The relevance and correctness of each resource were verified by searching on Google Scholar. Of the total 610 references in the nephrology literature, only 378 (62%) of the references provided by ChatGPT existed, while 31% were fabricated, and 7% of citations were incomplete references. Notably, only 122 (20%) of references were authentic. Additionally, 256 (68%) of the links in the references were found to be incorrect, and the DOI was inaccurate in 206 (54%) of the references. Moreover, among those with a link provided, the link was correct in only 20% of cases, and 3% of the references were irrelevant. Notably, an analysis of specific topics in electrolyte, hemodialysis, and kidney stones found that >60% of the references were inaccurate or misleading, with less reliable authorship and links provided by ChatGPT. Based on our findings, the use of ChatGPT as a sole resource for identifying references to literature reviews in nephrology is not recommended. Future studies could explore ways to improve AI language models’ performance in identifying relevant nephrology literature.

Publisher

MDPI AG

Subject

General Medicine

Link

https://www.mdpi.com/2077-0383/12/17/5550/pdf

Reference45 articles.

1. A beginner’s guide to the literature search in medical education;Martin;Scott. Med. J.,2017

2. Literature and medicine: A problem of assessment;Kuper;Acad. Med.,2006

3. Literature search for research planning and identification of research problem;Grewal;Indian J. Anaesth.,2016

4. The Benefits and Challenges of ChatGPT: An Overview;Deng;Front. Comput. Intell. Syst.,2022

5. ChatGPT: Five priorities for research;Bollen;Nature,2023

Cited by 26 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Large Language Models in Biomedical and Health Informatics: A Review with Bibliometric Analysis;Journal of Healthcare Informatics Research;2024-09-14

2. AI-Driven Patient Education in Chronic Kidney Disease: Evaluating Chatbot Responses against Clinical Guidelines;Diseases;2024-08-16

3. Reference Hallucination Score for Medical Artificial Intelligence Chatbots: Development and Usability Study;JMIR Medical Informatics;2024-07-31

4. Readability analysis of ChatGPT's responses on lung cancer;Scientific Reports;2024-07-26

5. The potential of ChatGPT in medicine: an example analysis of nephrology specialty exams in Poland;Clinical Kidney Journal;2024-06-22