Performance and Consistency of ChatGPT‐4 Versus Otolaryngologists: A Clinical Case Series-Reference-Cited by-同舟云学术

Performance and Consistency of ChatGPT‐4 Versus Otolaryngologists: A Clinical Case Series

Published:2024-04-09 Issue:6 Volume:170 Page:1519-1526
ISSN:0194-5998
Container-title:Otolaryngology–Head and Neck Surgery
language:en
Short-container-title:Otolaryngol.--head neck surg.

Author:

Lechien Jérôme R.¹²³⁴,Naunheim Mattheuw R.¹⁵,Maniaci Antonino¹⁶,Radulesco Thomas¹⁷,Saibene Alberto M.¹⁸,Chiesa‐Estomba Carlos M.¹⁹,Vaira Luigi A.¹¹⁰¹¹

Affiliation:

1. Research Committee of Young Otolaryngologists of the International Federation of Otorhinolaryngological Societies (IFOS) Paris France

2. Division of Laryngology and Broncho‐Esophagology, Department of Otolaryngology–Head Neck Surgery, EpiCURA Hospital, UMONS Research Institute for Health Sciences and Technology University of Mons (UMons) Mons Belgium

3. Department of Otorhinolaryngology and Head and Neck Surgery, Foch Hospital, Phonetics and Phonology Laboratory (UMR 7018 CNRS, Université Sorbonne Nouvelle/Paris 3) Paris Saclay University Paris France

4. Department of Otorhinolaryngology and Head and Neck Surgery CHU Saint‐Pierre Brussels Belgium

5. Department of Otolaryngology, Massachusetts Eye and Ear Harvard Medical School Boston Massachusetts USA

6. Department of medicine and surgery, Faculty of Medicine and Surgery University of Enna “Kore” Enna Italy

7. ENT‐HNS Department, APHM, CNRS, IUSTI, La Conception University Hospital Aix Marseille Univ Marseille France

8. Otolaryngology Unit, Department of Health Sciences, ASST Santi Paolo E Carlo Università Degli Studi Di Milano Milan Italy

9. Department of Otorhinolaryngology–Head and Neck Surgery Hospital Universitario Donostia San Sebastian Spain

10. Maxillofacial Surgery Operative Unit, Department of Medicine, Surgery and Pharmacy University of Sassari Sassari Italy

11. Department of Biomedical Sciences, PhD School of Biomedical Sciences University of Sassari Sassari Italy

Abstract

AbstractObjectiveTo study the performance of Chatbot Generative Pretrained Transformer‐4 (ChatGPT‐4) in the management of cases in otolaryngology–head and neck surgery.Study DesignProspective case series.SettingMulticenter University Hospitals.MethodsHistory, clinical, physical, and additional examinations of adult outpatients consulting in otolaryngology departments of CHU Saint‐Pierre and Dour Medical Center were presented to ChatGPT‐4, which was interrogated for differential diagnoses, management, and treatment(s). According to specialty, the ChatGPT‐4 responses were assessed by 2 distinct, blinded board‐certified otolaryngologists with the Artificial Intelligence Performance Instrument.ResultsOne hundred cases were presented to ChatGPT‐4. ChaGPT‐4 indicated a mean of 3.34 (95% confidence interval [CI]: 3.09, 3.59) additional examinations per patient versus 2.10 (95% CI: 1.76, 2.34; P = .001) for the practitioners. There was strong consistency (k > 0.600) between otolaryngologists and ChatGPT‐4 for the indication of upper aerodigestive tract endoscopy, positron emission tomography and computed tomography, audiometry, tympanometry, and psychophysical evaluations. Primary diagnosis was correctly performed by ChatGPT‐4 in 38% to 86% of cases depending on subspecialty. Additional examinations indicated by ChatGPT‐4 were pertinent and necessary in 8% to 31% of cases, while the treatment regimen was pertinent in 12% to 44% of cases. The performance of ChatGPT‐4 was not influenced by the human‐reported level of difficulty of clinical cases.ConclusionChatGPT‐4 may be a promising adjunctive tool in otolaryngology, providing extensive documentation about additional examinations, primary and differential diagnoses, and treatments. The ChatGPT‐4 is more effective in providing a primary diagnosis, and less effective in the selection of additional examinations and treatments.

Publisher

Wiley

Reference19 articles.

1. Accuracy of ChatGPT‐Generated Information on Head and Neck and Oromaxillofacial Surgery: A Multicenter Collaborative Analysis

2. Validity and reliability of an instrument evaluating the performance of intelligent chatbot: the Artificial Intelligence Performance Instrument (AIPI)

3. Is ChatGPT‐4 Accurate in Proofread a Manuscript in Otolaryngology–Head and Neck Surgery?

4. ChatGPT performance in laryngology and head and neck surgery: a clinical case-series

5. The Strengthening the Reporting of Observational Studies in Epidemiology (STROBE) statement: guidelines for reporting observational studies

Cited by 10 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Evaluation of Vertigo-Related Information from Artificial Intelligence Chatbot;2024-09-02

2. Enhancing AI Chatbot Responses in Healthcare: The SMART Prompt Structure in Head and Neck Surgery;2024-08-23

3. Investigating the role of artificial intelligence in predicting perceived dysphonia level;European Archives of Oto-Rhino-Laryngology;2024-08-22

4. Generative Large Language Models in Electronic Health Records for Patient Care Since 2023: A Systematic Review;2024-08-12

5. ChatGPT‐4 Consistency in Interpreting Laryngeal Clinical Images of Common Lesions and Disorders;Otolaryngology–Head and Neck Surgery;2024-07-24