Assessing the accuracy and utility of ChatGPT responses to patient questions regarding posterior lumbar decompression-Reference-Cited by-同舟云学术

Assessing the accuracy and utility of ChatGPT responses to patient questions regarding posterior lumbar decompression

Published:2024-09-04 Issue:3 Volume:4 Page:233-46
ISSN:2771-0408
Container-title:Artificial Intelligence Surgery
language:
Short-container-title:Art Int Surg

Author:

Giakas Alec M.^ORCID,Narayanan Rajkishen,Ezeonu Teeto,Dalton Jonathan,Lee Yunsoo,Henry Tyler,Mangan John,Schroeder Gregory,Vaccaro Alexander,Kepler Christopher

Abstract

Aim: To examine the clinical accuracy and applicability of ChatGPT answers to commonly asked questions from patients considering posterior lumbar decompression (PLD). Methods: A literature review was conducted to identify 10 questions that encompass some of the most common questions and concerns patients may have regarding lumbar decompression surgery. The selected questions were then posed to ChatGPT. Initial responses were then recorded, and no follow-up or clarifying questions were permitted. Two attending fellowship-trained spine surgeons then graded each response from the chatbot using a modified Global Quality Scale to evaluate ChatGPT’s accuracy and utility. The surgeons then analyzed each question, providing evidence-based justifications for the scores. Results: Minimum scores across all ten questions would lead to a total score of 20, whereas a maximum score would be 100. ChatGPT’s responses in this analysis earned a score of 59, just under an average score of 3, when evaluated by two attending spine surgeons. A score of 3 denoted a somewhat useful response of moderate quality, with some important information adequately discussed but some poorly discussed. Conclusion: ChatGPT has the ability to provide broadly useful responses to common preoperative questions that patients may have when considering undergoing PLD. ChatGPT has excellent utility in providing background information to patients and in helping them become more informed about their pathology in general. However, it often lacks the specific patient context necessary to provide patients with personalized, accurate insights into their prognosis and medical options.

Publisher

OAE Publishing Inc.

Reference38 articles.

1. The effect of Dr Google on doctor–patient encounters in primary care: a quantitative, observational, cross-sectional study

2. Dr Google in the ED: searching for online health information by adult emergency department patients

3. Internet use by orthopaedic outpatients – current trends and practices

4. Modern internet search analytics and spine: what are patients asking and reading online?

5. ChatGPT in medicine: an overview of its applications, advantages, limitations, future prospects, and ethical considerations