Assessment of Artificial Intelligence Chatbot Responses to Top Searched Queries About Cancer-Reference-Cited by-同舟云学术

Assessment of Artificial Intelligence Chatbot Responses to Top Searched Queries About Cancer

Published:2023-10-01 Issue:10 Volume:9 Page:1437
ISSN:2374-2437
Container-title:JAMA Oncology
language:en
Short-container-title:JAMA Oncol

Author:

Pan Alexander¹,Musheyev David¹,Bockelman Daniel¹,Loeb Stacy²³⁴,Kabarriti Abdo E.¹

Affiliation:

1. Department of Urology, State University of New York Downstate Health Sciences University, New York

2. Department of Urology, New York University School of Medicine, New York

3. Department of Population Health, New York University School of Medicine, New York

4. Department of Surgery, VA New York Harbor Health Care, New York

Abstract

ImportanceConsumers are increasingly using artificial intelligence (AI) chatbots as a source of information. However, the quality of the cancer information generated by these chatbots has not yet been evaluated using validated instruments.ObjectiveTo characterize the quality of information and presence of misinformation about skin, lung, breast, colorectal, and prostate cancers generated by 4 AI chatbots.Design, Setting, and ParticipantsThis cross-sectional study assessed AI chatbots’ text responses to the 5 most commonly searched queries related to the 5 most common cancers using validated instruments. Search data were extracted from the publicly available Google Trends platform and identical prompts were used to generate responses from 4 AI chatbots: ChatGPT version 3.5 (OpenAI), Perplexity (Perplexity.AI), Chatsonic (Writesonic), and Bing AI (Microsoft).ExposuresGoogle Trends’ top 5 search queries related to skin, lung, breast, colorectal, and prostate cancer from January 1, 2021, to January 1, 2023, were input into 4 AI chatbots.Main Outcomes and MeasuresThe primary outcomes were the quality of consumer health information based on the validated DISCERN instrument (scores from 1 [low] to 5 [high] for quality of information) and the understandability and actionability of this information based on the understandability and actionability domains of the Patient Education Materials Assessment Tool (PEMAT) (scores of 0%-100%, with higher scores indicating a higher level of understandability and actionability). Secondary outcomes included misinformation scored using a 5-item Likert scale (scores from 1 [no misinformation] to 5 [high misinformation]) and readability assessed using the Flesch-Kincaid Grade Level readability score.ResultsThe analysis included 100 responses from 4 chatbots about the 5 most common search queries for skin, lung, breast, colorectal, and prostate cancer. The quality of text responses generated by the 4 AI chatbots was good (median [range] DISCERN score, 5 [2-5]) and no misinformation was identified. Understandability was moderate (median [range] PEMAT Understandability score, 66.7% [33.3%-90.1%]), and actionability was poor (median [range] PEMAT Actionability score, 20.0% [0%-40.0%]). The responses were written at the college level based on the Flesch-Kincaid Grade Level score.Conclusions and RelevanceFindings of this cross-sectional study suggest that AI chatbots generally produce accurate information for the top cancer-related search queries, but the responses are not readily actionable and are written at a college reading level. These limitations suggest that AI chatbots should be used supplementarily and not as a primary source for medical information.

Publisher

American Medical Association (AMA)

Subject

Oncology,Cancer Research

Link

https://jamanetwork.com/journals/jamaoncology/articlepdf/2808733/jamaoncology_pan_2023_br_230013_1697125931.23609.pdf

Reference10 articles.

1. Performance of ChatGPT on USMLE: potential for AI-assisted medical education using large language models.;Kung;PLOS Digit Health,2023

2. Cancer statistics, 2023.;Siegel;CA Cancer J Clin,2023

3. DISCERN: an instrument for judging the quality of written consumer health information on treatment choices.;Charnock;J Epidemiol Community Health,1999

4. Development of the Patient Education Materials Assessment Tool (PEMAT): a new measure of understandability and actionability for print and audiovisual patient information.;Shoemaker;Patient Educ Couns,2014

5. Dissemination of misinformative and biased information about prostate cancer on YouTube.;Loeb;Eur Urol,2019

Cited by 45 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Can Patients With Urogenital Cancer Rely on Artificial Intelligence Chatbots for Treatment Decisions?;Clinical Genitourinary Cancer;2024-12

2. Assessing the accuracy and reliability of ChatGPT’s medical responses about thyroid cancer;International Journal of Medical Informatics;2024-11

3. Decoding medical jargon: The use of AI language models (ChatGPT-4, BARD, microsoft copilot) in radiology reports;Patient Education and Counseling;2024-09

4. Using Large Language Models to Generate Educational Materials on Childhood Glaucoma;American Journal of Ophthalmology;2024-09

5. Can AI chatbots accurately answer patient questions regarding vasectomies?;International Journal of Impotence Research;2024-08-24