Accuracy of Information given by ChatGPT for Patients with Inflammatory Bowel Disease in Relation to ECCO Guidelines

Author:

Sciberras Martina1ORCID,Farrugia Yvette1,Gordon Hannah23,Furfaro Federica4,Allocca Mariangela4ORCID,Torres Joana567ORCID,Arebi Naila89,Fiorino Gionata410ORCID,Iacucci Marietta11ORCID,Verstockt Bram1213ORCID,Magro Fernando14,Katsanos Kostas15,Busuttil Josef16,De Giovanni Katya16,Fenech Valerie Anne1,Chetcuti Zammit Stefania1,Ellul Pierre1

Affiliation:

1. Department of Medicine, Division of Gastroenterology, Mater Dei Hospital , Msida , Malta

2. Department of Gastroenterology, Barts Health NHS Trust , London , UK

3. Translational Gastroenterology and Liver Unit, John Radcliffe Hospital, University of Oxford , Oxford , UK

4. IRCCS OSPEDALE San Raffaele, Gastroenterology and Endoscopy, IBD Center , Milan , Italy

5. Division of Gastroenterology, Hospital da Luz , Lisbon , Portugal

6. Division of Gastroenterology, Hospital Beatriz Ângelo , Loures , Portugal

7. Faculdade de Medicina, Universidade de Lisboa , Lisbon , Portugal

8. Department of Inflammatory Bowel Disease, St Mark’s National Bowel Hospital , London , UK

9. Department of Metabolism, Digestion and Reproduction, Imperial College London , London , UK

10. IBD Unit, San Camillo-Forlanini Hospital , Rome , Italy

11. APC Microbiome Ireland, College of Medicine and Health, University College of Cork , Cork , Ireland

12. Department of Gastroenterology and Hepatology, University Hospitals Leuven, KU Leuven , Leuven , Belgium

13. Department of Chronic Diseases and Metabolism, KU Leuven , Leuven , Belgium

14. CINTESIS@RISE, Faculty of Medicine of the University of Porto , Porto , Portugal

15. Division of Gastroenterology, Department of Internal Medicine, Faculty of Medicine, University of Ioannina School of Health Sciences , Ioannina , Greece

16. Association for Crohn`s and Colitis , Malta

Abstract

Abstract Background As acceptance of artificial intelligence [AI] platforms increases, more patients will consider these tools as sources of information. The ChatGPT architecture utilizes a neural network to process natural language, thus generating responses based on the context of input text. The accuracy and completeness of ChatGPT3.5 in the context of inflammatory bowel disease [IBD] remains unclear. Methods In this prospective study, 38 questions worded by IBD patients were inputted into ChatGPT3.5. The following topics were covered: [1] Crohn’s disease [CD], ulcerative colitis [UC], and malignancy; [2] maternal medicine; [3] infection and vaccination; and [4] complementary medicine. Responses given by ChatGPT were assessed for accuracy [1—completely incorrect to 5—completely correct] and completeness [3-point Likert scale; range 1—incomplete to 3—complete] by 14 expert gastroenterologists, in comparison with relevant ECCO guidelines. Results In terms of accuracy, most replies [84.2%] had a median score of ≥4 (interquartile range [IQR]: 2) and a mean score of 3.87 [SD: ±0.6]. For completeness, 34.2% of the replies had a median score of 3 and 55.3% had a median score of between 2 and <3. Overall, the mean rating was 2.24 [SD: ±0.4, median: 2, IQR: 1]. Though groups 3 and 4 had a higher mean for both accuracy and completeness, there was no significant scoring variation between the four question groups [Kruskal–Wallis test p > 0.05]. However, statistical analysis for the different individual questions revealed a significant difference for both accuracy [p < 0.001] and completeness [p < 0.001]. The questions which rated the highest for both accuracy and completeness were related to smoking, while the lowest rating was related to screening for malignancy and vaccinations especially in the context of immunosuppression and family planning. Conclusion This is the first study to demonstrate the capability of an AI-based system to provide accurate and comprehensive answers to real-world patient queries in IBD. AI systems may serve as a useful adjunct for patients, in addition to standard of care in clinics and validated patient information resources. However, responses in specialist areas may deviate from evidence-based guidance and the replies need to give more firm advice.

Publisher

Oxford University Press (OUP)

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3