Comparison of ChatGPT, Gemini, and Le Chat with physician interpretations of medical laboratory questions from an online health forum-Reference-Cited by-同舟云学术

Comparison of ChatGPT, Gemini, and Le Chat with physician interpretations of medical laboratory questions from an online health forum

Published:2024-05-29 Issue: Volume: Page:
ISSN:1434-6621
Container-title:Clinical Chemistry and Laboratory Medicine (CCLM)
language:en
Short-container-title:

Author:

Meyer Annika¹^ORCID,Soleman Ari²,Riese Janik³,Streichert Thomas¹^ORCID

Affiliation:

1. Institute of Clinical Chemistry, Faculty of Medicine and University Hospital , 27182 University Hospital Cologne , Cologne , Germany

2. Faculty of Medicine and University Hospital , 27182 University Hospital Cologne , Cologne , Germany

3. Institute of Pathology, Faculty of Medicine , RWTH Aachen University , Aachen , Germany

Abstract

Abstract Objectives Laboratory medical reports are often not intuitively comprehensible to non-medical professionals. Given their recent advancements, easier accessibility and remarkable performance on medical licensing exams, patients are therefore likely to turn to artificial intelligence-based chatbots to understand their laboratory results. However, empirical studies assessing the efficacy of these chatbots in responding to real-life patient queries regarding laboratory medicine are scarce. Methods Thus, this investigation included 100 patient inquiries from an online health forum, specifically addressing Complete Blood Count interpretation. The aim was to evaluate the proficiency of three artificial intelligence-based chatbots (ChatGPT, Gemini and Le Chat) against the online responses from certified physicians. Results The findings revealed that the chatbots’ interpretations of laboratory results were inferior to those from online medical professionals. While the chatbots exhibited a higher degree of empathetic communication, they frequently produced erroneous or overly generalized responses to complex patient questions. The appropriateness of chatbot responses ranged from 51 to 64 %, with 22 to 33 % of responses overestimating patient conditions. A notable positive aspect was the chatbots’ consistent inclusion of disclaimers regarding its non-medical nature and recommendations to seek professional medical advice. Conclusions The chatbots’ interpretations of laboratory results from real patient queries highlight a dangerous dichotomy – a perceived trustworthiness potentially obscuring factual inaccuracies. Given the growing inclination towards self-diagnosis using AI platforms, further research and improvement of these chatbots is imperative to increase patients’ awareness and avoid future burdens on the healthcare system.

Publisher

Walter de Gruyter GmbH

Link

https://www.degruyter.com/document/doi/10.1515/cclm-2024-0246/pdf

Reference50 articles.

1. Cadamuro, J, Cabitza, F, Debeljak, Z, Bruyne, SD, Frans, G, Perez, SM, et al.. Potentials and pitfalls of ChatGPT and natural-language artificial intelligence models for the understanding of laboratory medicine test results. An assessment by the European federation of clinical chemistry and laboratory medicine (EFLM) working group on artificial intelligence (WG-AI). Clin Chem Lab Med 2023;61:1158–66. https://doi.org/10.1515/cclm-2023-0355.

2. Nov, O, Singh, N, Mann, D. Putting ChatGPT’s medical advice to the (turing) test: survey study. JMIR Med Educ 2023;9:e46939. https://doi.org/10.2196/46939.

3. Liebrenz, M, Schleifer, R, Buadze, A, Bhugra, D, Smith, A. Generating scholarly content with ChatGPT: ethical challenges for medical publishing. Lancet Digit Health 2023;5:e105–6. https://doi.org/10.1016/s2589-7500(23)00019-5.

4. Hu, K. ChatGPT sets record for fastest-growing user base – analyst note; 2023. https://www.reuters.com/technology/chatgpt-sets-record-fastest-growing-user-base-analyst-note-2023-02-01/ [Accessed 28 Dec 2023].

5. Shahsavar, Y, Choudhury, A. User intentions to use ChatGPT for self-diagnosis and health-related purposes: cross-sectional survey study. JMIR Hum Factors 2023;10:e47564. https://doi.org/10.2196/47564.

Cited by 1 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Generative artificial intelligence (AI) for reporting the performance of laboratory biomarkers: not ready for prime time;Clinical Chemistry and Laboratory Medicine (CCLM);2024-07-31