Evaluating the Accuracy of ChatGPT in Common Patient Questions Regarding HPV+ Oropharyngeal Carcinoma-Reference-Cited by-同舟云学术

Evaluating the Accuracy of ChatGPT in Common Patient Questions Regarding HPV+ Oropharyngeal Carcinoma

Published:2024-07-29 Issue:9 Volume:133 Page:814-819
ISSN:0003-4894
Container-title:Annals of Otology, Rhinology & Laryngology
language:en
Short-container-title:Ann Otol Rhinol Laryngol

Author:

Bellamkonda Nikhil¹^ORCID,Farlow Janice L.²^ORCID,Haring Catherine T.³,Sim Michael W.²,Seim Nolan B.³,Cannon Richard B.¹,Monroe Marcus M.¹,Agrawal Amit³,Rocco James W.³,McCrary Hilary C.¹^ORCID

Affiliation:

1. Department of Otolaryngology—Head and Neck Surgery, University of Utah, Salt Lake City, UT, USA

2. Department of Otolaryngology—Head and Neck Surgery, Indiana University, Indianapolis, IN, USA

3. Department of Otolaryngology-Head and Neck Surgery, The Ohio State University Wexner Medical Center, Columbus, OH, USA

Abstract

Objectives: Large language model (LLM)-based chatbots such as ChatGPT have been publicly available and increasingly utilized by the general public since late 2022. This study sought to investigate ChatGPT responses to common patient questions regarding Human Papilloma Virus (HPV) positive oropharyngeal cancer (OPC). Methods: This was a prospective, multi-institutional study, with data collected from high volume institutions that perform >50 transoral robotic surgery cases per year. The 100 most recent discussion threads including the term “HPV” on the American Cancer Society’s Cancer Survivors Network’s Head and Neck Cancer public discussion board were reviewed. The 11 most common questions were serially queried to ChatGPT 3.5; answers were recorded. A survey was distributed to fellowship trained head and neck oncologic surgeons at 3 institutions to evaluate the responses. Results: A total of 8 surgeons participated in the study. For questions regarding HPV contraction and transmission, ChatGPT answers were scored as clinically accurate and aligned with consensus in the head and neck surgical oncology community 84.4% and 90.6% of the time, respectively. For questions involving treatment of HPV+ OPC, ChatGPT was clinically accurate and aligned with consensus 87.5% and 91.7% of the time, respectively. For questions regarding the HPV vaccine, ChatGPT was clinically accurate and aligned with consensus 62.5% and 75% of the time, respectively. When asked about circulating tumor DNA testing, only 12.5% of surgeons thought responses were accurate or consistent with consensus. Conclusion: ChatGPT 3.5 performed poorly with questions involving evolving therapies and diagnostics—thus, caution should be used when using a platform like ChatGPT 3.5 to assess use of advanced technology. Patients should be counseled on the importance of consulting their surgeons to receive accurate and up to date recommendations, and use LLM’s to augment their understanding of these important health-related topics.

Publisher

SAGE Publications

Link

https://journals.sagepub.com/doi/pdf/10.1177/00034894241259137

Reference32 articles.

1. OpenAI. GPT-4 technical report. arXiv. 2023;4:1-100.

2. ChatGPT in medicine: an overview of its applications, advantages, limitations, future prospects, and ethical considerations

3. Diagnostic and Management Applications of ChatGPT in Structured Otolaryngology Clinical Scenarios

4. ChatGPT’s quiz skills in different otolaryngology subspecialties: an analysis of 2576 single-choice and multiple-choice board certification preparation questions

5. Comparison of Ophthalmologist and Large Language Model Chatbot Responses to Online Patient Eye Care Questions