Fact Check: Assessing the Response of ChatGPT to Alzheimer’s Disease Statements with Varying Degrees of Misinformation

Author:

Huang Sean S.ORCID,Song Qingyuan,Beiting Kimberly J.,Duggan Maria C.,Hines Kristin,Murff Harvey,Leung Vania,Powers James,Harvey T.S.,Malin Bradley,Yin Zhijun

Abstract

AbstractBackgroundThere are many myths regarding Alzheimer’s disease (AD) that have been circulated on the Internet, each exhibiting varying degrees of accuracy, inaccuracy, and misinformation. Large language models such as ChatGPT, may be a useful tool to help assess these myths for veracity and inaccuracy. However, they can induce misinformation as well. The objective of this study is to assess ChatGPT’s ability to identify and address AD myths with reliable information.MethodsWe conducted a cross-sectional study of clinicians’ evaluation of ChatGPT (GPT 4.0)’s responses to 20 selected AD myths. We prompted ChatGPT to express its opinion on each myth and then requested it to rephrase its explanation using a simplified language that could be more readily understood by individuals with a middle school education. We implemented a survey using Redcap to determine the degree to which clinicians agreed with the accuracy of each ChatGPT’s explanation and the degree to which the simplified rewriting was readable and retained the message of the original. We also collected their explanation on any disagreement with ChatGPT’s responses. We used five Likert-type scale with a score ranging from -2 to 2 to quantify clinicians’ agreement in each aspect of the evaluation.ResultsThe clinicians (n=11) were generally satisfied with ChatGPT’s explanations, with a mean (SD) score of 1.0(±0.3) across the 20 myths. While ChatGPT correctly identified that all the 20 myths were inaccurate, some clinicians disagreed with its explanations on 7 of the myths.Overall, 9 of the 11 professionals either agreed or strongly agreed that ChatGPT has the potential to provide meaningful explanations of certain myths.ConclusionsThe majority of surveyed healthcare professionals acknowledged the potential value of ChatGPT in mitigating AD misinformation. However, the need for more refined and detailed explanations of the disease’s mechanisms and treatments was highlighted.Impact StatementThere are many statements regarding Alzheimer’s disease (AD) diagnosis, management, and treatment circulating on the Internet, each exhibiting varying degrees of accuracy, inaccuracy, and misinformation. Large language models are a popular topic currently, and many patients and caregivers may turn to LLMs such as ChatGPT to learn more about the disease. This study aims to assess ChatGPT’s ability to identify and address AD myths with reliable information. We certify that this work is novel.Key Points-Geriatricians acknowledged the potential value of ChatGPT in mitigating misinformation in Alzheimer’s Disease-There remain nuanced cases where ChatGPT explanations are not as refined or appropriate.-Why does this matter? Large language models such as ChatGPT are very popular nowadays and patients and caregivers often may use them to learn about their disease. The paper seeks to determine whether ChatGPT does an appropriate job in moderating understanding of Alzheimer’s Disease myths.

Publisher

Cold Spring Harbor Laboratory

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3