Affiliation:
1. Text Information Processing Laboratory, Kitami Institute of Technology, Kitami 090-8507, Japan
2. School of Information Technology, Oulu University of Applied Sciences, 90570 Oulu, Finland
Abstract
False news refers to false, fake, or misleading information presented as real news. In recent years, there has been a noticeable increase in false news on the Internet. The goal of this paper was to study the automatic detection of such false news using machine learning and natural language processing techniques and to determine which techniques work the most effectively. This article first studies what constitutes false news and how it differs from other types of misleading information. We also study the results achieved by other researchers on the same topic. After building a foundation to understand false news and the various ways of automatically detecting it, this article provides its own experiments. These experiments were carried out on four different datasets, one that was made just for this article, using 10 different machine learning methods. The results of this article were satisfactory and provided answers to the original research questions set up at the beginning of this article. This article could determine from the experiments that passive aggressive algorithms, support vector machines, and random forests are the most efficient methods for automatic false news detection. This article also concluded that more complex experiments, such as using multiple levels of identifying false news or detecting computer-generated false news, require more complex machine learning models.
Reference56 articles.
1. Gruener, S. (2024, August 12). An Empirical Study on False News on Internet-Based False News Stories: Experiences, Problem Awareness, and Responsibilities. Available online: https://ssrn.com/abstract=3351911.
2. Hitlin, P. (2024, August 12). False Reporting on the Internet and the Spread of Rumors: Three Case Studies. Gnovis Journal. Georgetown University: Washington, DC, USA. Available online: http://pascalfroissart.online.fr/3-cache/2004-hitlin.pdf.
3. “Fake news” is not simply false information: A concept explication and taxonomy of online content;Molina;Am. Behav. Sci.,2021
4. Wang, W.Y. (2017). “liar, liar pants on fire”: A new benchmark dataset for fake news detection. arXiv.
5. Supervised Learning for Fake News Detection;Reis;IEEE Intell. Syst.,2019