Potentials and pitfalls of ChatGPT and natural-language artificial intelligence models for the understanding of laboratory medicine test results. An assessment by the European Federation of Clinical Chemistry and Laboratory Medicine (EFLM) Working Group on Artificial Intelligence (WG-AI)-Reference-Cited by-同舟云学术

Potentials and pitfalls of ChatGPT and natural-language artificial intelligence models for the understanding of laboratory medicine test results. An assessment by the European Federation of Clinical Chemistry and Laboratory Medicine (EFLM) Working Group on Artificial Intelligence (WG-AI)

Published:2023-04-24 Issue:7 Volume:61 Page:1158-1166
ISSN:1434-6621
Container-title:Clinical Chemistry and Laboratory Medicine (CCLM)
language:en
Short-container-title:

Author:

Cadamuro Janne¹^ORCID,Cabitza Federico²³,Debeljak Zeljko⁴⁵,De Bruyne Sander⁶,Frans Glynis⁷^ORCID,Perez Salomon Martin⁸,Ozdemir Habib⁹^ORCID,Tolios Alexander¹⁰,Carobene Anna¹¹,Padoan Andrea¹²^ORCID

Affiliation:

1. Department of Laboratory Medicine , Paracelsus Medical University Salzburg , Salzburg , Austria

2. DISCo , Università degli Studi di Milano-Bicocca , Milano , Italy

3. IRCCS Istituto Ortopedico Galeazzi , Milan , Italy

4. Faculty of Medicine , Josip Juraj Strossmayer University of Osijek , Osijek , Croatia

5. Clinical Institute of Laboratory Diagnostics , University Hospital Center Osijek , Osijek , Croatia

6. Department of Laboratory Medicine , Ghent University Hospital , Ghent , Belgium

7. Department of Laboratory Medicine , University Hospitals Leuven, KU Leuven , Leuven , Belgium

8. Unidad de Bioquímica Clínica , Hospital Universitario Virgen Macarena , Sevilla , Spain

9. Department of Medical Biochemistry, Faculty of Medicine , Manisa Celal Bayar University , Manisa , Türkiye

10. Department of Transfusion Medicine and Cell Therapy , Medical University of Vienna , Vienna , Austria

11. IRCCS San Raffaele Scientific Institute , Milan , Italy

12. Department of Medicine (DIMED) , University of Padova , Padova , Italy

Abstract

Abstract Objectives ChatGPT, a tool based on natural language processing (NLP), is on everyone’s mind, and several potential applications in healthcare have been already proposed. However, since the ability of this tool to interpret laboratory test results has not yet been tested, the EFLM Working group on Artificial Intelligence (WG-AI) has set itself the task of closing this gap with a systematic approach. Methods WG-AI members generated 10 simulated laboratory reports of common parameters, which were then passed to ChatGPT for interpretation, according to reference intervals (RI) and units, using an optimized prompt. The results were subsequently evaluated independently by all WG-AI members with respect to relevance, correctness, helpfulness and safety. Results ChatGPT recognized all laboratory tests, it could detect if they deviated from the RI and gave a test-by-test as well as an overall interpretation. The interpretations were rather superficial, not always correct, and, only in some cases, judged coherently. The magnitude of the deviation from the RI seldom plays a role in the interpretation of laboratory tests, and artificial intelligence (AI) did not make any meaningful suggestion regarding follow-up diagnostics or further procedures in general. Conclusions ChatGPT in its current form, being not specifically trained on medical data or laboratory data in particular, may only be considered a tool capable of interpreting a laboratory report on a test-by-test basis at best, but not on the interpretation of an overall diagnostic picture. Future generations of similar AIs with medical ground truth training data might surely revolutionize current processes in healthcare, despite this implementation is not ready yet.

Publisher

Walter de Gruyter GmbH

Subject

Biochemistry (medical),Clinical Biochemistry,General Medicine

Link

https://www.degruyter.com/document/doi/10.1515/cclm-2023-0355/pdf

Reference32 articles.

1. Plebani, M, Laposata, M, Lippi, G. Driving the route of laboratory medicine: a manifesto for the future. Intern Emerg Med 2019;14:337–40. https://doi.org/10.1007/s11739-019-02053-z.

2. Ngo, A, Gandhi, P, Miller, WG. Frequency that laboratory tests influence medical decisions. J Appl Lab Med 2017;1:410–4. https://doi.org/10.1373/jalm.2016.021634.

3. Rohr, UP, Binder, C, Dieterle, T, Giusti, F, Messina, CG, Toerien, E, et al.. The value of in vitro diagnostic testing in medical practice: a status report. PLoS One 2016;11:e0149856. https://doi.org/10.1371/journal.pone.0149856.

4. OpenAI. Chatbot generative pre-trained transformer, ChatGPT. Available from: https://openai.com/blog/chatgpt [Accessed 6 Apr 2023].

5. Kung, TH, Cheatham, M, Medenilla, A, Sillos, C, Leon, LD, Elepaño, C, et al.. Performance of ChatGPT on USMLE: potential for AI-assisted medical education using large language models. Dagan A, editor. PLoS Digit Health 2023;2:e0000198. https://doi.org/10.1371/journal.pdig.0000198.

Cited by 57 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Beyond the Scalpel: Assessing ChatGPT's potential as an auxiliary intelligent virtual assistant in oral surgery;Computational and Structural Biotechnology Journal;2024-12

2. A Comparative Analysis of Large language Models on Clinical Questions for Autoimmune Diseases;2024-08-27

3. Enhancing Patient Understanding of Laboratory Test Results: Systematic Review of Presentation Formats and Their Impact on Perception, Decision, Action, and Memory;Journal of Medical Internet Research;2024-08-12

4. ChatGPT in medicine: A cross-disciplinary systematic review of ChatGPT’s (artificial intelligence) role in research, clinical practice, education, and patient interaction;Medicine;2024-08-09

5. Generative artificial intelligence (AI) for reporting the performance of laboratory biomarkers: not ready for prime time;Clinical Chemistry and Laboratory Medicine (CCLM);2024-07-31