Evaluating ChatGPT-4’s performance as a digital health advisor for otosclerosis surgery-Reference-Cited by-同舟云学术

Evaluating ChatGPT-4’s performance as a digital health advisor for otosclerosis surgery

Published:2024-06-05 Issue: Volume:11 Page:
ISSN:2296-875X
Container-title:Frontiers in Surgery
language:
Short-container-title:Front. Surg.

Author:

Sahin Samil,Erkmen Burak,Duymaz Yaşar Kemal,Bayram Furkan,Tekin Ahmet Mahmut,Topsakal Vedat

Abstract

PurposeThis study aims to evaluate the effectiveness of ChatGPT-4, an artificial intelligence (AI) chatbot, in providing accurate and comprehensible information to patients regarding otosclerosis surgery.MethodsOn October 20, 2023, 15 hypothetical questions were posed to ChatGPT-4 to simulate physician-patient interactions about otosclerosis surgery. Responses were evaluated by three independent ENT specialists using the DISCERN scoring system. The readability was evaluated using multiple indices: Flesch Reading Ease (FRE), Flesch-Kincaid Grade Level (FKGL), Gunning Fog Index (Gunning FOG), Simple Measure of Gobbledygook (SMOG), Coleman-Liau Index (CLI), and Automated Readability Index (ARI).ResultsThe responses from ChatGPT-4 received DISCERN scores ranging from poor to excellent, with an overall score of 50.7 ± 8.2. The readability analysis indicated that the texts were above the 6th-grade level, suggesting they may not be easily comprehensible to the average reader. There was a significant positive correlation between the referees’ scores. Despite providing correct information in over 90% of the cases, the study highlights concerns regarding the potential for incomplete or misleading answers and the high readability level of the responses.ConclusionWhile ChatGPT-4 shows potential in delivering health information accurately, its utility is limited by the level of readability of its responses. The study underscores the need for continuous improvement in AI systems to ensure the delivery of information that is both accurate and accessible to patients with varying levels of health literacy. Healthcare professionals should supervise the use of such technologies to enhance patient education and care.

Publisher

Frontiers Media SA

Reference27 articles.

1. Comprehensiveness of online sources for patient education on hereditary hearing impairment;Duymaz;Front Pediatr,2023

2. Applicability of ChatGPT in assisting to solve higher order problems in pathology;Sinha;Cureus,2023

3. The role of ChatGPT, generative language models, and artificial intelligence in medical education: a conversation with ChatGPT and a call for papers;Eysenbach;JMIR Med Educ,2023

4. An overview of the etiology of otosclerosis;Markou;Eur Arch Oto-Rhino-Laryngol,2009

5. Otosclerosis: an update on diagnosis and treatment;Batson;J Am Acad Physician Assist,2017

Cited by 2 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Assessment of readability, reliability, and quality of ChatGPT®, BARD®, Gemini®, Copilot®, Perplexity® responses on palliative care;Medicine;2024-08-16

2. Letter to the Editor: Comment on: “Reliability of artificial intelligence chatbot responses to frequently asked questions in breast surgical oncology”;Journal of Surgical Oncology;2024-07-17