ChatGPT Performs on the Chinese National Medical Licensing Examination-Reference-Cited by-同舟云学术

ChatGPT Performs on the Chinese National Medical Licensing Examination

Published:2023-02-16 Issue: Volume: Page:
ISSN:
Container-title:
language:
Short-container-title:

Author:

Wang Xinyi,Gong Zhenye,Wang Guoxin,Jia Jingdan,Xu Ying,Zhao Jialu,Fan Qingye,Wu Shaun,Hu Weiguo,Li Xiaoyang

Abstract

Abstract INTRODUCTION: ChatGPT, a language model developed by OpenAI, uses a 175 billion parameter Transformer architecture for natural language processing tasks. This study aimed to compare the knowledge and interpretation ability of ChatGPT with those of medical students in China by administering the Chinese National Medical Licensing Examination (NMLE) to both ChatGPT and medical students. METHODS We evaluated the performance of ChatGPT in two years' worth of the NMLE, which consists of four units. At the same time, the exam results were compared to those of medical students who had studied for five years at medical colleges. RESULTS ChatGPT’s performance was lower than that of the medical students, and ChatGPT’s correct answer rate was related to the year in which the exam questions were released. CONCLUSION ChatGPT’s knowledge and interpretation ability for the NMLE were not yet comparable to those of medical students in China. It is probable that these abilities will improve through deep learning.

Publisher

Research Square Platform LLC

Reference15 articles.

1. ChatGPT and Other Large Language Models Are Double-edged Swords.Radiology;Shen Y;2023 Jan

2. Som Biswas. ChatGPT and the Future of Medical Writing.Radiology.Feb 2 2023 :223312 https://doi.org/10.1148/radiol.223312 Shuai Wang, Harrisen Scells, Bevan Koopman, Guido Zuccon. Can ChatGPT Write a Good Boolean Query for Systematic Review Literature Search? arXiv. Preprint posted online on 3 Feb 2023 https://doi.org/10.48550/arXiv.2302.03495 Biyang Guo, Xin Zhang, Ziyuan Wang, Minqi Jiang, Jinran Nie, Yuxuan Ding, Jianwei Yue, Yupeng Wu. How Close is ChatGPT to Human Experts? Comparison Corpus, Evaluation, and Detection. arXiv. Preprint posted online on 18 Jan 2023 https://doi.org/10.48550/arXiv.2301.07597

3. The Future of AI in Medicine: A Perspective from a Chatbot;King MR;Ann Biomed Eng,2023

4. Avisha Das, Salih Selek, Alia R. Warner, Xu Zuo, Yan Hu, Vipina Kuttichi Keloth, Jianfu Li, W. Jim Zheng, and Hua Xu. 2022. Conversational Bots for Psychotherapy: A Study of Generative Transformer Models Using Domain-specific Dialogues. In Proceedings of the 21st Workshop on Biomedical Language Processing, pages 285–297, Dublin, Ireland. Association for Computational Linguistics. https://doi.org/10.18653/v1/2022.bionlp-1.27

5. Mijwil, M., Mohammad Aljanabi, & Ahmed Hussein Ali. (2023). ChatGPT: Exploring the Role of Cybersecurity in the Protection of Medical Information. Mesopotamian Journal of CyberSecurity, 2023, 18–21. https://doi.org/10.58496/MJCS/2023/004

Cited by 24 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Evaluating ChatGPT's effectiveness and tendencies in Japanese internal medicine;Journal of Evaluation in Clinical Practice;2024-05-19

2. Performance of generative pre-trained transformers (GPTs) in Certification Examination of the College of Family Physicians of Canada;Family Medicine and Community Health;2024-05

3. Initial discussions of ChatGPT in education-related subreddits;Journal of Research on Technology in Education;2024-04-08

4. “Better Than Human” in Partnership With AI;Advances in Mobile and Distance Learning;2024-03-29

5. ChatGPT ve Sağlık Bilimlerinde Kullanımı;Celal Bayar Üniversitesi Sağlık Bilimleri Enstitüsü Dergisi;2024-03-27