Challenging ChatGPT 3.5 in Senology—An Assessment of Concordance with Breast Cancer Tumor Board Decision Making-Reference-Cited by-同舟云学术

Challenging ChatGPT 3.5 in Senology—An Assessment of Concordance with Breast Cancer Tumor Board Decision Making

Published:2023-10-16 Issue:10 Volume:13 Page:1502
ISSN:2075-4426
Container-title:Journal of Personalized Medicine
language:en
Short-container-title:JPM

Author:

Griewing Sebastian¹²³^ORCID,Gremke Niklas²^ORCID,Wagner Uwe²,Lingenfelder Michael³,Kuhn Sebastian¹^ORCID,Boekhoff Jelena²

Affiliation:

1. Institute for Digital Medicine, University Hospital Marburg, Philipps-University Marburg, Baldingerstraße, 35043 Marburg, Germany

2. Department of Gynecology and Obstetrics, University Hospital Marburg, Philipps-University Marburg, Baldingerstraße, 35043 Marburg, Germany

3. Institute for Healthcare Management, Chair of General Business Administration, Philipps-University Marburg, Universitätsstraße 24, 35037 Marburg, Germany

Abstract

With the recent diffusion of access to publicly available large language models (LLMs), common interest in generative artificial-intelligence-based applications for medical purposes has skyrocketed. The increased use of these models by tech-savvy patients for personal health issues calls for a scientific evaluation of whether LLMs provide a satisfactory level of accuracy for treatment decisions. This observational study compares the concordance of treatment recommendations from the popular LLM ChatGPT 3.5 with those of a multidisciplinary tumor board for breast cancer (MTB). The study design builds on previous findings by combining an extended input model with patient profiles reflecting patho- and immunomorphological diversity of primary breast cancer, including primary metastasis and precancerous tumor stages. Overall concordance between the LLM and MTB is reached for half of the patient profiles, including precancerous lesions. In the assessment of invasive breast cancer profiles, the concordance amounts to 58.8%. Nevertheless, as the LLM makes considerably fraudulent decisions at times, we do not identify the current development status of publicly available LLMs to be adequate as a support tool for tumor boards. Gynecological oncologists should familiarize themselves with the capabilities of LLMs in order to understand and utilize their potential while keeping in mind potential risks and limitations.

Funder

Deutsche Forschungsgemeinschaft

Clinician Scientist program (SUCCESS-program) of Philipps-University of Marburg and the University Hospital of Giessen and Marburg

Publisher

MDPI AG

Subject

Medicine (miscellaneous)

Link

https://www.mdpi.com/2075-4426/13/10/1502/pdf

Reference31 articles.

1. Artificial intelligence and machine learning in precision and genomic medicine;Quazi;Med. Oncol.,2022

2. Ghaderzadeh, M., Aria, M., and Asadi, F. (2021). X-Ray equipped with artificial intelligence: Changing the COVID-19 diagnostic paradigm during the pandemic. Biomed. Res. Int., 2021.