Illusion of Truth: Analysing and Classifying COVID-19 Fake News in Brazilian Portuguese Language-Reference-Cited by-同舟云学术

Illusion of Truth: Analysing and Classifying COVID-19 Fake News in Brazilian Portuguese Language

Published:2022-04-01 Issue:2 Volume:6 Page:36
ISSN:2504-2289
Container-title:Big Data and Cognitive Computing
language:en
Short-container-title:BDCC

Author:

Endo Patricia Takako^ORCID,Santos Guto Leoni,de Lima Xavier Maria Eduarda,Nascimento Campos Gleyson Rhuan,de Lima Luciana Conceição,Silva Ivanovitch^ORCID,Egli Antonia^ORCID,Lynn Theo^ORCID

Abstract

Public health interventions to counter the COVID-19 pandemic have accelerated and increased digital adoption and use of the Internet for sourcing health information. Unfortunately, there is evidence to suggest that it has also accelerated and increased the spread of false information relating to COVID-19. The consequences of misinformation, disinformation and misinterpretation of health information can interfere with attempts to curb the virus, delay or result in failure to seek or continue legitimate medical treatment and adherence to vaccination, as well as interfere with sound public health policy and attempts to disseminate public health messages. While there is a significant body of literature, datasets and tools to support countermeasures against the spread of false information online in resource-rich languages such as English and Chinese, there are few such resources to support Portuguese, and Brazilian Portuguese specifically. In this study, we explore the use of machine learning and deep learning techniques to identify fake news in online communications in the Brazilian Portuguese language relating to the COVID-19 pandemic. We build a dataset of 11,382 items comprising data from January 2020 to February 2021. Exploratory data analysis suggests that fake news about the COVID-19 vaccine was prevalent in Brazil, much of it related to government communications. To mitigate the adverse impact of fake news, we analyse the impact of machine learning to detect fake news based on stop words in communications. The results suggest that stop words improve the performance of the models when keeping them within the message. Random Forest was the machine learning model with the best results, achieving 97.91% of precision, while Bi-GRU was the best deep learning model with an F1 score of 94.03%.

Publisher

MDPI AG

Subject

Artificial Intelligence,Computer Science Applications,Information Systems,Management Information Systems

Link

https://www.mdpi.com/2504-2289/6/2/36/pdf

Reference114 articles.

1. The internet as a source of health information and services;Bujnowska-Fedak,2019

2. Online Nation 2020 Reporthttps://www.ofcom.org.uk/__data/assets/pdf_file/0027/196407/online-nation-2020-report.pdf

3. Sorting the Healthy Diet Signal from the Social Media Expert Noise: Preliminary Evidence from the Healthy Diet Discourse on Twitter

4. Information exchange in social networks for health care

Cited by 7 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Automatic detection of fake tweets about the COVID-19 Vaccine in Portuguese;Social Network Analysis and Mining;2024-03-08

2. Sentiment Analysis in the Age of COVID-19: A Bibliometric Perspective;Information;2023-12-13

3. Artificial intelligence applied to analyzes during the pandemic: COVID-19 beds occupancy in the state of Rio Grande do Norte, Brazil;Frontiers in Artificial Intelligence;2023-12-08

4. Machine Learning-Based Identifications of COVID-19 Fake News Using Biomedical Information Extraction;Big Data and Cognitive Computing;2023-03-07

5. A Systematic Literature Review and Meta-Analysis of Studies on Online Fake News Detection;Information;2022-11-04