Performance and Consistency of ChatGPT‐4 Versus Otolaryngologists: A Clinical Case Series

Author:

Lechien Jérôme R.1234,Naunheim Mattheuw R.15,Maniaci Antonino16,Radulesco Thomas17,Saibene Alberto M.18,Chiesa‐Estomba Carlos M.19,Vaira Luigi A.11011

Affiliation:

1. Research Committee of Young Otolaryngologists of the International Federation of Otorhinolaryngological Societies (IFOS) Paris France

2. Division of Laryngology and Broncho‐Esophagology, Department of Otolaryngology–Head Neck Surgery, EpiCURA Hospital, UMONS Research Institute for Health Sciences and Technology University of Mons (UMons) Mons Belgium

3. Department of Otorhinolaryngology and Head and Neck Surgery, Foch Hospital, Phonetics and Phonology Laboratory (UMR 7018 CNRS, Université Sorbonne Nouvelle/Paris 3) Paris Saclay University Paris France

4. Department of Otorhinolaryngology and Head and Neck Surgery CHU Saint‐Pierre Brussels Belgium

5. Department of Otolaryngology, Massachusetts Eye and Ear Harvard Medical School Boston Massachusetts USA

6. Department of medicine and surgery, Faculty of Medicine and Surgery University of Enna “Kore” Enna Italy

7. ENT‐HNS Department, APHM, CNRS, IUSTI, La Conception University Hospital Aix Marseille Univ Marseille France

8. Otolaryngology Unit, Department of Health Sciences, ASST Santi Paolo E Carlo Università Degli Studi Di Milano Milan Italy

9. Department of Otorhinolaryngology–Head and Neck Surgery Hospital Universitario Donostia San Sebastian Spain

10. Maxillofacial Surgery Operative Unit, Department of Medicine, Surgery and Pharmacy University of Sassari Sassari Italy

11. Department of Biomedical Sciences, PhD School of Biomedical Sciences University of Sassari Sassari Italy

Abstract

AbstractObjectiveTo study the performance of Chatbot Generative Pretrained Transformer‐4 (ChatGPT‐4) in the management of cases in otolaryngology–head and neck surgery.Study DesignProspective case series.SettingMulticenter University Hospitals.MethodsHistory, clinical, physical, and additional examinations of adult outpatients consulting in otolaryngology departments of CHU Saint‐Pierre and Dour Medical Center were presented to ChatGPT‐4, which was interrogated for differential diagnoses, management, and treatment(s). According to specialty, the ChatGPT‐4 responses were assessed by 2 distinct, blinded board‐certified otolaryngologists with the Artificial Intelligence Performance Instrument.ResultsOne hundred cases were presented to ChatGPT‐4. ChaGPT‐4 indicated a mean of 3.34 (95% confidence interval [CI]: 3.09, 3.59) additional examinations per patient versus 2.10 (95% CI: 1.76, 2.34; P = .001) for the practitioners. There was strong consistency (k > 0.600) between otolaryngologists and ChatGPT‐4 for the indication of upper aerodigestive tract endoscopy, positron emission tomography and computed tomography, audiometry, tympanometry, and psychophysical evaluations. Primary diagnosis was correctly performed by ChatGPT‐4 in 38% to 86% of cases depending on subspecialty. Additional examinations indicated by ChatGPT‐4 were pertinent and necessary in 8% to 31% of cases, while the treatment regimen was pertinent in 12% to 44% of cases. The performance of ChatGPT‐4 was not influenced by the human‐reported level of difficulty of clinical cases.ConclusionChatGPT‐4 may be a promising adjunctive tool in otolaryngology, providing extensive documentation about additional examinations, primary and differential diagnoses, and treatments. The ChatGPT‐4 is more effective in providing a primary diagnosis, and less effective in the selection of additional examinations and treatments.

Publisher

Wiley

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3