ChatGPT in Occupational Medicine: A Comparative Study with Human Experts-Reference-Cited by-同舟云学术

ChatGPT in Occupational Medicine: A Comparative Study with Human Experts

Published:2024-01-06 Issue:1 Volume:11 Page:57
ISSN:2306-5354
Container-title:Bioengineering
language:en
Short-container-title:Bioengineering

Author:

Padovan Martina¹^ORCID,Cosci Bianca¹,Petillo Armando¹,Nerli Gianluca¹,Porciatti Francesco¹,Scarinci Sergio¹,Carlucci Francesco¹,Dell’Amico Letizia¹,Meliani Niccolò¹,Necciari Gabriele¹,Lucisano Vincenzo Carmelo¹,Marino Riccardo¹,Foddis Rudy¹^ORCID,Palla Alessandro²^ORCID

Affiliation:

1. Department of Translational Research and New Technologies in Medicine and Surgery, University of Pisa, 56126 Pisa, Italy

2. Intel Corporation, Santa Clara, CA 95054, USA

Abstract

The objective of this study is to evaluate ChatGPT’s accuracy and reliability in answering complex medical questions related to occupational health and explore the implications and limitations of AI in occupational health medicine. The study also provides recommendations for future research in this area and informs decision-makers about AI’s impact on healthcare. A group of physicians was enlisted to create a dataset of questions and answers on Italian occupational medicine legislation. The physicians were divided into two teams, and each team member was assigned a different subject area. ChatGPT was used to generate answers for each question, with/without legislative context. The two teams then evaluated human and AI-generated answers blind, with each group reviewing the other group’s work. Occupational physicians outperformed ChatGPT in generating accurate questions on a 5-point Likert score, while the answers provided by ChatGPT with access to legislative texts were comparable to those of professional doctors. Still, we found that users tend to prefer answers generated by humans, indicating that while ChatGPT is useful, users still value the opinions of occupational medicine professionals.

Publisher

MDPI AG

Link

https://www.mdpi.com/2306-5354/11/1/57/pdf

Reference46 articles.

1. Artificial intelligence powers digital medicine;Fogel;NPJ Digit. Med.,2018

2. Artificial Intelligence in Radiology: Overview of Application Types, Design, and Challenges;Moassefi;Semin. Roentgenol.,2023

3. Deep Neural Networks Can Predict New-Onset Atrial Fibrillation From the 12-Lead ECG and Help Identify Those at Risk of Atrial Fibrillation-Related Stroke;Raghunath;Circulation,2021

4. Integrated Machine Learning and Bioinformatic Analyses Constructed a Novel Stemness-Related Classifier to Predict Prognosis and Immunotherapy Responses for Hepatocellular Carcinoma Patients;Chen;Int. J. Biol. Sci.,2022

5. Srinivasu, P.N., SivaSai, J.G., Ijaz, M.F., Bhoi, A.K., Kim, W., and Kang, J.J. (2021). Classification of Skin Disease Using Deep Learning Neural Networks with MobileNet V2 and LSTM. Sensors, 21.

Cited by 6 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Artificial Intelligence and ChatGPT Models in Healthcare;Advances in Logistics, Operations, and Management Science;2024-08-23

2. Toward Clinical Generative AI: Conceptual Framework;JMIR AI;2024-06-07

3. Exploring the competence of ChatGPT for customer and patient service management;Intelligent Pharmacy;2024-06

4. Art or Artifact: Evaluating the Accuracy, Appeal, and Educational Value of AI-Generated Imagery in DALL·E 3 for Illustrating Congenital Heart Diseases;Journal of Medical Systems;2024-05-23

5. Art or Artifact: Evaluating the Accuracy, Appeal, and Educational Value of AI-Generated Imagery in DALL·E 3 for Illustrating Congenital Heart Diseases;2024-01-26