Author:
Kung Tiffany H.,Cheatham Morgan,Medenilla Arielle,Sillos Czarina,De Leon Lorie,Elepaño Camille,Madriaga Maria,Aggabao Rimel,Diaz-Candido Giezel,Maningo James,Tseng Victor,
Abstract
ABSTRACTWe evaluated the performance of a large language model called ChatGPT on the United States Medical Licensing Exam (USMLE), which consists of three exams: Step 1, Step 2CK, and Step 3. ChatGPT performed at or near the passing threshold for all three exams without any specialized training or reinforcement. Additionally, ChatGPT demonstrated a high level of concordance and insight in its explanations. These results suggest that large language models may have the potential to assist with medical education, and potentially, clinical decision-making.
Publisher
Cold Spring Harbor Laboratory
Cited by
96 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献