Topic Modeling of Health Rumors Based on Anti-Rumor Tweets on the WeChat Platform: Machine Learning Analysis (Preprint)

Author:

Li ZiyuORCID,Wu XiaoqianORCID,Xu LinORCID,Liu MingORCID,Huang ChengORCID

Abstract

BACKGROUND

Social network has become one of the main channels for the public to obtain health information. However, it has also become a source for the spread of health-related misinformation. Health-related misinformation seriously threatens the public’s physical and mental health. Topic identification is the premise of health-related misinformation governance.

OBJECTIVE

This study aims to explore the types of health-related misinformation favored by WeChat public platform users and their prevalence trends.

METHODS

We used a web crawler tool to capture health rumor-dispelling tweets collected on the rumor-dispelling public account. We collected text information from health-debunking tweets posted between January 1, 2016, and August 31, 2022. Following word segmentation of the collected text, a document topic generation model, Latent Dirichlet Assignment, is used to identify and generalize the most common topics. The proportion distribution of themes was calculated, and the negative impact of various health rumors in different periods was subsequently analyzed. Additionally, the prevalence of health rumors was analyzed using the number of health rumors generated at each time point.

RESULTS

From January 1, 2016 to August 31, 2022, we collected 9,366 rumor-refuting tweets from WeChat official accounts. Through topic modeling, we divided the health rumors into eight topics, including the prevention and treatment of infectious diseases (n = 1,284, 13.71%), disease therapy and its effects (n = 1,037, 11.07%), food safety (n = 1,243, 13.27%), cancer and its causes (n = 946, 10.10%), regimen and disease (n = 1,540, 16.44%), rumors of transmission (n = 914, 9.76%), healthy diet (n = 1,068, 11.40%), and nutrition and health (n = 1,334, 14.24%). Furthermore, we summarized the eight topics into four themes, including public health, disease, diet and health, and rumor spreading.

CONCLUSIONS

Our study shows that the topic model can provide analysis and insights into health rumor governance. The analysis of rumor development trends shows that public health, disease, and diet and health problems are the most affected areas of rumors. Governments still need to consider national conditions, formulate appropriate policies, and deal with health rumors more comprehensively. While ensuring the health of the Internet, we should also improve the level of national quality education. We recommend that additional sentiment analysis-related studies be conducted to verify the impact of health rumor-related topics.

Publisher

JMIR Publications Inc.

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3