Evaluating the Quality and Usability of Artificial Intelligence–Generated Responses to Common Patient Questions in Foot and Ankle Surgery-Reference-Cited by-同舟云学术

Evaluating the Quality and Usability of Artificial Intelligence–Generated Responses to Common Patient Questions in Foot and Ankle Surgery

Published:2023-10 Issue:4 Volume:8 Page:
ISSN:2473-0114
Container-title:Foot & Ankle Orthopaedics
language:en
Short-container-title:Foot & Ankle Orthopaedics

Author:

Anastasio Albert Thomas¹^ORCID,Mills Frederic Baker¹,Karavan Mark P.¹^ORCID,Adams Samuel B.¹

Affiliation:

1. Department of Orthopaedic Surgery, Duke University Medical Center, Durham, NC, USA

Abstract

Background: Artificial intelligence (AI) platforms, such as ChatGPT, have become increasingly popular outlets for the consumption and distribution of health care–related advice. Because of a lack of regulation and oversight, the reliability of health care–related responses has become a topic of controversy in the medical community. To date, no study has explored the quality of AI-derived information as it relates to common foot and ankle pathologies. This study aims to assess the quality and educational benefit of ChatGPT responses to common foot and ankle–related questions. Methods: ChatGPT was asked a series of 5 questions, including “What is the optimal treatment for ankle arthritis?” “How should I decide on ankle arthroplasty versus ankle arthrodesis?” “Do I need surgery for Jones fracture?” “How can I prevent Charcot arthropathy?” and “Do I need to see a doctor for my ankle sprain?” Five responses (1 per each question) were included after applying the exclusion criteria. The content was graded using DISCERN (a well-validated informational analysis tool) and AIRM (a self-designed tool for exercise evaluation). Results: Health care professionals graded the ChatGPT-generated responses as bottom tier 4.5% of the time, middle tier 27.3% of the time, and top tier 68.2% of the time. Conclusion: Although ChatGPT and other related AI platforms have become a popular means for medical information distribution, the educational value of the AI-generated responses related to foot and ankle pathologies was variable. With 4.5% of responses receiving a bottom-tier rating, 27.3% of responses receiving a middle-tier rating, and 68.2% of responses receiving a top-tier rating, health care professionals should be aware of the high viewership of variable-quality content easily accessible on ChatGPT. Level of Evidence: Level III, cross sectional study.

Publisher

SAGE Publications

Subject

Orthopedics and Sports Medicine

Link

http://journals.sagepub.com/doi/pdf/10.1177/24730114231209919

Reference25 articles.

1. Deep Learning Algorithms Improve the Detection of Subtle Lisfranc Malalignments on Weightbearing Radiographs

2. Detection of ankle fractures using deep learning algorithms

3. Predictive Behavior of a Computational Foot/Ankle Model through Artificial Neural Networks

4. The Role of ChatGPT, Generative Language Models, and Artificial Intelligence in Medical Education: A Conversation With ChatGPT and a Call for Papers

5. Sanders classification of calcaneal fractures in CT images with deep learning and differential data augmentation techniques

Cited by 7 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Evaluating the interactions of Medical Doctors with chatbots based on large language models: Insights from a nationwide study in the Greek healthcare sector using ChatGPT;Computers in Human Behavior;2024-12

2. Currently Available Large Language Models Do Not Provide Musculoskeletal Treatment Recommendations That Are Concordant With Evidence-Based Clinical Practice Guidelines;Arthroscopy: The Journal of Arthroscopic & Related Surgery;2024-08

3. Editorial Commentary: At Present, ChatGPT Cannot Be Relied Upon to Answer Patient Questions and Requires Physician Expertise to Interpret Answers for Patients;Arthroscopy: The Journal of Arthroscopic & Related Surgery;2024-07

4. ChatGPT-4 Knows Its A B C D E but Cannot Cite Its Source;JBJS Open Access;2024-07

5. A Use Case for Generative AI in Medical Education;JMIR Medical Education;2024-06-07