Comparing ChatGPT's and Surgeon's Responses to Thyroid-related Questions From Patients

Author:

Guo Siyin1,Li Ruicen2,Li Genpeng1,Chen Wenjie1ORCID,Huang Jing1,He Linye1,Ma Yu1,Wang Liying1,Zheng Hongping3,Tian Chunxiang4,Zhao Yatong5,Pan Xinmin6,Wan Hongxing7,Liu Dasheng8,Li Zhihui1,Lei Jianyong1ORCID

Affiliation:

1. Division of Thyroid Surgery, Department of General Surgery, West China Hospital, Sichuan University , Chengdu, Sichuan 610041 , China

2. Health Management Center, General Practice Medical Center, West China Hospital, Sichuan University , Chengdu, Sichuan 610041 , China

3. Department of Thyroid Surgery, General Surgery Ward 7, The First Hospital of Lanzhou University , Lanzhou, Gansu 730000 , China

4. Chengdu Women’s and Children’s Central Hospital, School of Medicine, University of Electronic Science and Technology of China , Chengdu, Sichuan 610031 , China

5. Thyroid Surgery, Zhengzhou Central Hospital Affiliated of Zhengzhou University , Zhengzhou, Henan 450007 , China

6. Department of Thyroid Surgery, General Surgery III, Gansu Provincial Hospital , Lanzhou, Gansu 730000 , China

7. Department of Oncology, Sanya People’s Hospital , Sanya, Hainan 572000 , China

8. Department of Vascular Thyroid Surgery, The Second Affiliated Hospital of Guangzhou University of Chinese Medicine , Guangzhou, Guangdong 510120 , China

Abstract

Abstract Context For some common thyroid-related conditions with high prevalence and long follow-up times, ChatGPT can be used to respond to common thyroid-related questions. Objective In this cross-sectional study, we assessed the ability of ChatGPT (version GPT-4.0) to provide accurate, comprehensive, compassionate, and satisfactory responses to common thyroid-related questions. Methods First, we obtained 28 thyroid-related questions from the Huayitong app, which together with the 2 interfering questions eventually formed 30 questions. Then, these questions were responded to by ChatGPT (on July 19, 2023), a junior specialist, and a senior specialist (on July 20, 2023) separately. Finally, 26 patients and 11 thyroid surgeons evaluated those responses on 4 dimensions: accuracy, comprehensiveness, compassion, and satisfaction. Results Among the 30 questions and responses, ChatGPT's speed of response was faster than that of the junior specialist (8.69 [7.53-9.48] vs 4.33 [4.05-4.60]; P < .001) and the senior specialist (8.69 [7.53-9.48] vs 4.22 [3.36-4.76]; P < .001). The word count of the ChatGPT's responses was greater than that of both the junior specialist (341.50 [301.00-384.25] vs 74.50 [51.75-84.75]; P < .001) and senior specialist (341.50 [301.00-384.25] vs 104.00 [63.75-177.75]; P < .001). ChatGPT received higher scores than the junior specialist and senior specialist in terms of accuracy, comprehensiveness, compassion, and satisfaction in responding to common thyroid-related questions. Conclusion ChatGPT performed better than a junior specialist and senior specialist in answering common thyroid-related questions, but further research is needed to validate the logical ability of the ChatGPT for complex thyroid questions.

Publisher

The Endocrine Society

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3