ChatGPT Is Equivalent to First-Year Plastic Surgery Residents: Evaluation of ChatGPT on the Plastic Surgery In-Service Examination-Reference-Cited by-同舟云学术

ChatGPT Is Equivalent to First-Year Plastic Surgery Residents: Evaluation of ChatGPT on the Plastic Surgery In-Service Examination

Published:2023-05-04 Issue:12 Volume:43 Page:NP1085-NP1089
ISSN:1090-820X
Container-title:Aesthetic Surgery Journal
language:en
Short-container-title:

Author:

Humar Pooja,Asaad Malke^ORCID,Bengur Fuat Baris,Nguyen Vu

Abstract

Abstract Background ChatGPT is an artificial intelligence language model developed and released by OpenAI (San Francisco, CA) in late 2022. Objectives The aim of this study was to evaluate the performance of ChatGPT on the Plastic Surgery In-Service Examination and to compare it to residents’ performance nationally. Methods The Plastic Surgery In-Service Examinations from 2018 to 2022 were used as a question source. For each question, the stem and all multiple-choice options were imported into ChatGPT. The 2022 examination was used to compare the performance of ChatGPT to plastic surgery residents nationally. Results In total, 1129 questions were included in the final analysis and ChatGPT answered 630 (55.8%) of these correctly. ChatGPT scored the highest on the 2021 exam (60.1%) and on the comprehensive section (58.7%). There were no significant differences regarding questions answered correctly among exam years or among the different exam sections. ChatGPT answered 57% of questions correctly on the 2022 exam. When compared to the performance of plastic surgery residents in 2022, ChatGPT would rank in the 49th percentile for first-year integrated plastic surgery residents, 13th percentile for second-year residents, 5th percentile for third- and fourth-year residents, and 0th percentile for fifth- and sixth-year residents. Conclusions ChatGPT performs at the level of a first-year resident on the Plastic Surgery In-Service Examination. However, it performed poorly when compared with residents in more advanced years of training. Although ChatGPT has many undeniable benefits and potential uses in the field of healthcare and medical education, it will require additional research to assess its efficacy.

Publisher

Oxford University Press (OUP)

Subject

General Medicine,Surgery

Link

https://academic.oup.com/asj/advance-article-pdf/doi/10.1093/asj/sjad130/51797661/sjad130.pdf

Reference14 articles.

1. How does ChatGPT perform on the United States Medical Licensing Examination? The implications of large language models for medical education and knowledge assessment;Gilson;JMIR Med Educ,2023

2. Medical education trends for future physicians in the era of advanced technology and artificial intelligence: an integrative review;Han;BMC Med Educ,2019

3. Initial impressions of ChatGPT for anatomy education;Mogali;Anat Sci Educ,2023

4. Open artificial intelligence platforms in nursing education: tools for academic progress or abuse?;O’Connor;Nurse Educ Pract,2023

Cited by 78 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. ChatGPT efficacy for answering musculoskeletal anatomy questions: a study evaluating quality and consistency between raters and timepoints;Surgical and Radiologic Anatomy;2024-09-12

2. Letter: Performance of ChatGPT and GPT-4 on Neurosurgery Written Board Examinations;Neurosurgery;2024-09-06

3. ChatGPT-4 Surpasses Residents: A Study of Artificial Intelligence Competency in Plastic Surgery In-service Examinations and Its Advancements from ChatGPT-3.5;Plastic and Reconstructive Surgery - Global Open;2024-09

4. ARTIFICIAL INTELLIGENCE IN PLASTIC SURGERY, WHERE DO WE STAND?;JPRAS Open;2024-09

5. Assessment Study of ChatGPT-3.5’s Performance on the Final Polish Medical Examination: Accuracy in Answering 980 Questions;Healthcare;2024-08-16