Feasibility of GPT-3 and GPT-4 for in-Depth Patient Education Prior to Interventional Radiological Procedures: A Comparative Analysis-Reference-Cited by-同舟云学术

Feasibility of GPT-3 and GPT-4 for in-Depth Patient Education Prior to Interventional Radiological Procedures: A Comparative Analysis

Published:2023-10-23 Issue: Volume: Page:
ISSN:0174-1551
Container-title:CardioVascular and Interventional Radiology
language:en
Short-container-title:Cardiovasc Intervent Radiol

Author:

Scheschenja Michael^ORCID,Viniol Simon,Bastian Moritz B.,Wessendorf Joel,König Alexander M.,Mahnken Andreas H.

Abstract

Abstract Purpose This study explores the utility of the large language models, GPT-3 and GPT-4, for in-depth patient education prior to interventional radiology procedures. Further, differences in answer accuracy between the models were assessed. Materials and methods A total of 133 questions related to three specific interventional radiology procedures (Port implantation, PTA and TACE) covering general information as well as preparation details, risks and complications and post procedural aftercare were compiled. Responses of GPT-3 and GPT-4 were assessed for their accuracy by two board-certified radiologists using a 5-point Likert scale. The performance difference between GPT-3 and GPT-4 was analyzed. Results Both GPT-3 and GPT-4 responded with (5) “completely correct” (4) “very good” answers for the majority of questions ((5) 30.8% + (4) 48.1% for GPT-3 and (5) 35.3% + (4) 47.4% for GPT-4). GPT-3 and GPT-4 provided (3) “acceptable” responses 15.8% and 15.0% of the time, respectively. GPT-3 provided (2) “mostly incorrect” responses in 5.3% of instances, while GPT-4 had a lower rate of such occurrences, at just 2.3%. No response was identified as potentially harmful. GPT-4 was found to give significantly more accurate responses than GPT-3 (p = 0.043). Conclusion GPT-3 and GPT-4 emerge as relatively safe and accurate tools for patient education in interventional radiology. GPT-4 showed a slightly better performance. The feasibility and accuracy of these models suggest their promising role in revolutionizing patient care. Still, users need to be aware of possible limitations. Graphical Abstract

Funder

Philipps-Universität Marburg

Publisher

Springer Science and Business Media LLC

Subject

Cardiology and Cardiovascular Medicine,Radiology, Nuclear Medicine and imaging

Link

https://link.springer.com/content/pdf/10.1007/s00270-023-03563-2.pdf

Reference14 articles.

1. Koski E, Murphy J. AI in healthcare. Stud Health Technol Inform. 2021;284:295–9. https://doi.org/10.3233/SHTI210726.

2. Lecler A, Duron L, Soyer P. Revolutionizing radiology with GPT-based models: current applications, future possibilities and limitations of ChatGPT. Diagn Interv Imaging. 2023;104(6):269–74. https://doi.org/10.1016/j.diii.2023.02.003.

3. O’Connor S. Open artificial intelligence platforms in nursing education: tools for academic progress or abuse? Nurse Educ Pract. 2023;66:103537. https://doi.org/10.1016/j.nepr.2022.103537.