Performance of ChatGPT on USMLE: Potential for AI-assisted medical education using large language models-Reference-Cited by-同舟云学术

Performance of ChatGPT on USMLE: Potential for AI-assisted medical education using large language models

Published:2023-02-09 Issue:2 Volume:2 Page:e0000198
ISSN:2767-3170
Container-title:PLOS Digital Health
language:en
Short-container-title:PLOS Digit Health

Author:

Kung Tiffany H.,Cheatham Morgan,Medenilla Arielle,Sillos Czarina,De Leon Lorie,Elepaño Camille,Madriaga Maria,Aggabao Rimel,Diaz-Candido Giezel,Maningo James,Tseng Victor^ORCID

Abstract

We evaluated the performance of a large language model called ChatGPT on the United States Medical Licensing Exam (USMLE), which consists of three exams: Step 1, Step 2CK, and Step 3. ChatGPT performed at or near the passing threshold for all three exams without any specialized training or reinforcement. Additionally, ChatGPT demonstrated a high level of concordance and insight in its explanations. These results suggest that large language models may have the potential to assist with medical education, and potentially, clinical decision-making.

Publisher

Public Library of Science (PLoS)

Reference25 articles.

1. Reproducibility in machine learning for health research: Still a ways to go.;MBA McDermott;Sci Transl Med.,2021

2. How to develop machine learning models for healthcare.;P-HC Chen;Nat Mater.,2019

Cited by 1637 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Beyond the Scalpel: Assessing ChatGPT's potential as an auxiliary intelligent virtual assistant in oral surgery;Computational and Structural Biotechnology Journal;2024-12

2. Evaluating the interactions of Medical Doctors with chatbots based on large language models: Insights from a nationwide study in the Greek healthcare sector using ChatGPT;Computers in Human Behavior;2024-12

3. ChatGPT performance on radiation technologist and therapist entry to practice exams;Journal of Medical Imaging and Radiation Sciences;2024-12

4. Can ChatGPT make surgical decisions with confidence similar to experienced knee surgeons?;The Knee;2024-12

5. Accuracy assessment of ChatGPT responses to frequently asked questions regarding anterior cruciate ligament surgery;The Knee;2024-12