The Scientific Knowledge of Bard and ChatGPT in Endocrinology, Diabetes, and Diabetes Technology: Multiple-Choice Questions Examination-Based Performance-Reference-Cited by-同舟云学术

The Scientific Knowledge of Bard and ChatGPT in Endocrinology, Diabetes, and Diabetes Technology: Multiple-Choice Questions Examination-Based Performance

Published:2023-10-05 Issue: Volume: Page:
ISSN:1932-2968
Container-title:Journal of Diabetes Science and Technology
language:en
Short-container-title:J Diabetes Sci Technol

Author:

Meo Sultan Ayoub¹^ORCID,Al-Khlaiwi Thamir¹,AbuKhalaf Abdulelah Adnan²^ORCID,Meo Anusha Sultan³^ORCID,Klonoff David C.⁴^ORCID

Affiliation:

1. Department of Physiology, College of Medicine, King Saud University, Riyadh, Saudi Arabia

2. College of Medicine, King Saud University, Riyadh, Saudi Arabia

3. The School of Medicine, Medical Sciences and Nutrition, University of Aberdeen, Aberdeen, UK

4. Diabetes Research Institute, Mills-Peninsula Medical Center, San Mateo, CA, USA

Abstract

Background:The present study aimed to investigate the knowledge level of Bard and ChatGPT in the areas of endocrinology, diabetes, and diabetes technology through a multiple-choice question (MCQ) examination format.Methods:Initially, a 100-MCQ bank was established based on MCQs in endocrinology, diabetes, and diabetes technology. The MCQs were created from physiology, medical textbooks, and academic examination pools in the areas of endocrinology, diabetes, and diabetes technology and academic examination pools. The study team members analyzed the MCQ contents to ensure that they were related to the endocrinology, diabetes, and diabetes technology. The number of MCQs from endocrinology was 50, and that from diabetes and science technology was also 50. The knowledge level of Google’s Bard and ChatGPT was assessed with an MCQ-based examination.Results:In the endocrinology examination section, ChatGPT obtained 29 marks (correct responses) of 50 (58%), and Bard obtained a similar score of 29 of 50 (58%). However, in the diabetes technology examination section, ChatGPT obtained 23 marks of 50 (46%), and Bard obtained 20 marks of 50 (40%). Overall, in the entire three-part examination, ChatGPT obtained 52 marks of 100 (52%), and Bard obtained 49 marks of 100 (49%). ChatGPT obtained slightly more marks than Bard. However, both ChatGPT and Bard did not achieve satisfactory scores in endocrinology or diabetes/technology of at least 60%.Conclusions:The overall MCQ-based performance of ChatGPT was slightly better than that of Google’s Bard. However, both ChatGPT and Bard did not achieve appropriate scores in endocrinology and diabetes/diabetes technology. The study indicates that Bard and ChatGPT have the potential to facilitate medical students and faculty in academic medical education settings, but both artificial intelligence tools need more updated information in the fields of endocrinology, diabetes, and diabetes technology.

Funder

Deputyship for Research and Innovation, Ministry of Education, Saudi Arabia

Publisher

SAGE Publications

Subject

Biomedical Engineering,Bioengineering,Endocrinology, Diabetes and Metabolism,Internal Medicine

Link

http://journals.sagepub.com/doi/pdf/10.1177/19322968231203987

Reference21 articles.

1. Aydın Ö. Google Bard generated literature review: metaverse. 2023. https://papers.ssrn.com/abstract=4454615.

Cited by 19 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Evaluating the accuracy and adequacy of ChatGPT in responding to queries of diabetes patients in primary healthcare;International Journal of Diabetes in Developing Countries;2024-09-11

2. Assessment Study of ChatGPT-3.5’s Performance on the Final Polish Medical Examination: Accuracy in Answering 980 Questions;Healthcare;2024-08-16

3. How good is ChatGPT at answering patients’ questions related to early detection of oral (mouth) cancer?;Oral Surgery, Oral Medicine, Oral Pathology and Oral Radiology;2024-08

4. ChatGPT-3.5 Versus Google Bard: Which Large Language Model Responds Best to Commonly Asked Pregnancy Questions?;Cureus;2024-07-27

5. Is ChatGPT reliable and accurate in answering pharmacotherapy-related inquiries in both Turkish and English?;Currents in Pharmacy Teaching and Learning;2024-07