Does Chat<scp>GPT</scp> Answer Otolaryngology Questions Accurately?-Reference-Cited by-同舟云学术

Does ChatGPT Answer Otolaryngology Questions Accurately?

Published:2024-03-28 Issue:9 Volume:134 Page:4011-4015
ISSN:0023-852X
Container-title:The Laryngoscope
language:en
Short-container-title:The Laryngoscope

Author:

Maksimoski Matthew¹²^ORCID,Noble Anisha Rhea¹²^ORCID,Smith David F.¹²³^ORCID

Affiliation:

1. Division of Pediatric Otolaryngology Cincinnati Children's Hospital Medical Center Cincinnati Ohio U.S.A.

2. Department of Otolaryngology ‐ Head and Neck Surgery University of Cincinnati 231 Albert Sabin Way Cincinnati USA

3. Division of Sleep and Circadian Medicine Cincinnati Children's Hospital Medical Center Cincinnati Ohio U.S.A.

Abstract

ObjectiveInvestigate the accuracy of ChatGPT in the manner of medical questions related to otolaryngology.MethodsA ChatGPT session was opened within which 93 questions were asked related to otolaryngology topics. Questions were drawn from all major domains within otolaryngology and based upon key action statements (KAS) from clinical practice guidelines (CPGs). Twenty‐one “patient‐level” questions were also asked of the program. Answers were graded as either “correct,” “partially correct,” “incorrect,” or “non‐answer.”ResultsCorrect answers were given at a rate of 45.5% (71.4% correct in patient‐level, 37.3% CPG); partially correct answers at 31.8% (28.6% patient‐level, 32.8% CPG); incorrect at 21.6% (0% patient‐level, 28.4% CPG); and 1.1% non‐answers (% patient‐level, 1.5% CPG). There was no difference in the rate of correct answers between CPGs published before or after the period of data collection cited by ChatGPT. CPG‐based questions were less likely to be correct than patient‐level questions (p = 0.003).ConclusionPublicly available artificial intelligence software has become increasingly popular with consumers for everything from story‐telling to data collection. In this study, we examined the accuracy of ChatGPT responses to questions related to otolaryngology over 7 domains and 21 published CPGs. Physicians and patients should understand the limitations of this software as it applies to otolaryngology, and programmers in future iterations should consider giving greater weight to information published by well‐established journals and written by national content experts.Level of EvidenceN/A Laryngoscope, 134:4011–4015, 2024

Publisher

Wiley

Link

https://onlinelibrary.wiley.com/doi/pdf/10.1002/lary.31410

Reference40 articles.

1. WeitzmanT.GPT‐4 released: what it means for the future of your business.Forbes2023.

2. AiO.ChatGPT Interface: Open AI.2023.chat.openai.com.

3. MehdiY.blogs.microsoft.com: Microsoft.2023.https://blogs.microsoft.com/blog/2023/02/07/reinventing-search-with-a-new-ai-powered-microsoft-bing-and-edge-your-copilot-for-the-web/.

4. ShahS.The writers strike is taking a stand on AI.Time2023.

Cited by 3 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Evaluation of Generative Language Models in Personalizing Medical Information: Instrument Validation Study;JMIR AI;2024-08-13

2. Enhancing Health Literacy: Evaluating the Readability of Patient Handouts Revised by ChatGPT's Large Language Model;Otolaryngology–Head and Neck Surgery;2024-08-06

3. Evaluating a Large Language Model’s Ability to Answer Clinicians’ Requests for Evidence Summaries;2024-05-03