Enhancing Arabic Fake News Detection: Evaluating Data Balancing Techniques Across Multiple Machine Learning Models-Reference-Cited by-同舟云学术

Enhancing Arabic Fake News Detection: Evaluating Data Balancing Techniques Across Multiple Machine Learning Models

Published:2024-08-02 Issue:4 Volume:14 Page:15947-15956
ISSN:1792-8036
Container-title:Engineering, Technology & Applied Science Research
language:
Short-container-title:Eng. Technol. Appl. Sci. Res.

Author:

Aljohani Eman

Abstract

The spread of fake news has become a serious concern in the era of rapid information dissemination through social networks, especially when it comes to Arabic-language content, where automated detection systems are not as advanced as those for English-language content. This study evaluates the effectiveness of various data balancing techniques, such as class weights, random under-sampling, SMOTE, and SMOTEENN, across multiple machine learning models, namely XGBoost, Random Forest, CNN, BIGRU, BILSTM, CNN-LSTM, and CNN-BIGRU, to address the critical challenge of dataset imbalance in Arabic fake news detection. Accuracy, AUC, precision, recall, and F1-score were used to evaluate the performance of these models on balanced and imbalanced datasets. The results show that SMOTEENN greatly improves model performance, especially the F1-score, precision, and recall. In addition to advancing the larger objective of preserving information credibility on social networks, this study emphasizes the need for advanced data balancing strategies to improve Arabic fake news detection systems.

Publisher

Engineering, Technology & Applied Science Research

Reference25 articles.

1. L. Alsudias and P. Rayson, "COVID-19 and Arabic Twitter: How can Arab World Governments and Public Health Organizations Learn from Social Media?," in Proceedings of the 1st Workshop on NLP for COVID-19 at ACL 2020, Online, Apr. 2020, [Online]. Available: https://aclanthology.org/2020.nlpcovid19-acl.16.

2. B. Ahmed, G. Ali, A. Hussain, A. Baseer, and J. Ahmed, "Analysis of Text Feature Extractors using Deep Learning on Fake News," Engineering, Technology & Applied Science Research, vol. 11, no. 2, pp. 7001–7005, Apr. 2021.

3. S. Alqurashi, B. Hamoui, A. Alashaikh, A. Alhindi, and E. Alanazi, "Eating Garlic Prevents COVID-19 Infection: Detecting Misinformation on the Arabic Content of Twitter," arXiv.org, Jan. 09, 2021. https://arxiv.org/abs/2101.05626v1.

4. K. M. Fouad, S. F. Sabbeh, and W. Medhat, "Arabic fake news detection using deep learning," Computers, Materials and Continua, vol. 71, no. 2, pp. 3647–3665, 2022.

5. S. Alyoubi, M. Kalkatawi, and F. Abukhodair, "The Detection of Fake News in Arabic Tweets Using Deep Learning," Applied Sciences, vol. 13, no. 14, Jan. 2023, Art. no. 8209.