Diagnostic Accuracy of GPT Multimodal Analysis on USMLE Questions Including Text and Visuals-Reference-Cited by-同舟云学术

Diagnostic Accuracy of GPT Multimodal Analysis on USMLE Questions Including Text and Visuals

Published:2023-10-31 Issue: Volume: Page:
ISSN:
Container-title:
language:
Short-container-title:

Author:

Sorin Vera^ORCID,Glicksberg Benjamin S.^ORCID,Barash Yiftach,Konen Eli,Nadkarni Girish,Klang Eyal

Abstract

AbstractObjectiveLarge Language Models (LLMs) have demonstrated proficiency in free-text analysis in healthcare. With recent advancements, GPT-4 now has the capability to analyze both text and accompanying images. The aim of this study was to evaluate the performance of the multimodal GPT-4 in analyzing medical images using USMLE questions that incorporate visuals.MethodsWe analyzed GPT-4’s performance on 55 USMLE sample questions across the three steps. In separate chat instances we provided the model with each question both with and without the images. We calculated accuracy with and without the images provided.ResultsGPT-4 achieved an accuracy of 80.0% with images and 65.0% without. No cases existed where the model answered correctly without images and incorrectly with them. Performance varied across USMLE steps and was significantly better for questions with figures compared to graphs.ConclusionGPT-4 demonstrated an ability to analyze medical images from USMLE questions, including graphs and figures. A multimodal LLM in healthcare could potentially accelerate both patient care and research, by integrating visual data and text in analysis processes.

Publisher

Cold Spring Harbor Laboratory

Reference13 articles.

1. Benefits, Limits, and Risks of GPT-4 as an AI Chatbot for Medicine

2. Creation and Adoption of Large Language Models in Medicine;JAMA,2023

3. Large language models for oncological applications;Journal of Cancer Research and Clinical Oncology,2023

4. Sorin V , Klang E , Sklair-Levy M , et al. Large language model (ChatGPT) as a support tool for breast tumor board. NPJ Breast Cancer 2023;9.

5. ChatGPT Utility in Healthcare Education, Research, and Practice: Systematic Review on the Promising Perspectives and Valid Concerns;Healthcare,2023

Cited by 2 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Large language models illuminate a progressive pathway to artificial intelligent healthcare assistant;Medicine Plus;2024-06

2. Applications of Large Language Models (LLMs) in Breast Cancer Care;2023-11-04