Public Awareness and Sentiment Analysis of COVID-Related Discussions Using BERT-Based Infoveillance

Author:

Xie Tianyi1,Ge Yaorong1,Xu Qian2ORCID,Chen Shi3ORCID

Affiliation:

1. Department of Software and Information Systems, University of North Carolina at Charlotte, Charlotte, NC 28223, USA

2. School of Communications, Elon University, Elon, NC 27244, USA

3. Department of Public Health Sciences, University of North Carolina at Charlotte, Charlotte, NC 28223, USA

Abstract

Understanding different aspects of public concerns and sentiments during large health emergencies, such as the COVID-19 pandemic, is essential for public health agencies to develop effective communication strategies, deliver up-to-date and accurate health information, and mitigate potential impacts of emerging misinformation. Current infoveillance systems generally focus on discussion intensity (i.e., number of relevant posts) as an approximation of public awareness, while largely ignoring the rich and diverse information in texts with granular information of varying public concerns and sentiments. In this study, we address this grand challenge by developing a novel natural language processing (NLP) infoveillance workflow based on bidirectional encoder representation from transformers (BERT). We first used a smaller COVID-19 tweet sample to develop a content classification and sentiment analysis model using COVID-Twitter-BERT. The classification accuracy was between 0.77 and 0.88 across the five identified topics. In the sentiment analysis with a three-class classification task (positive/negative/neutral), BERT achieved decent accuracy, 0.7. We then applied the content topic and sentiment classifiers to a much larger dataset with more than 4 million tweets in a 15-month period. We specifically analyzed non-pharmaceutical intervention (NPI) and social issue content topics. There were significant differences in terms of public awareness and sentiment towards the overall COVID-19, NPI, and social issue content topics across time and space. In addition, key events were also identified to associate with abrupt sentiment changes towards NPIs and social issues. This novel NLP-based AI workflow can be readily adopted for real-time granular content topic and sentiment infoveillance beyond the health context.

Funder

Models of Infectious Disease Agents Study (MIDAS) Network through NIH/NIGMS

Publisher

MDPI AG

Subject

Industrial and Manufacturing Engineering

Reference33 articles.

1. Ebola and the social media;Fung;Lancet,2014

2. Social media in Ebola outbreak;Hossain;Epidemiol. Infect.,2016

3. Understanding the Patterns of Health Information Dissemination on Social Media during the Zika Outbreak;Gui;AMIA Annu. Symp. Proc.,2017

4. Karabag, S.F. (2020). An Unprecedented Global Crisis! The Global, Regional, National, Political, Economic and Commercial Impact of the Coronavirus Pandemic, Linkoping University.

5. Analysing the Combined Health, Social and Economic Impacts of the Corovanvirus Pandemic Using Agent-Based Social Simulation;Dignum;Minds Mach.,2020

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3