Robust Benchmark for Propagandist Text Detection and Mining High-Quality Data-Reference-Cited by-同舟云学术

Robust Benchmark for Propagandist Text Detection and Mining High-Quality Data

Published:2023-06-12 Issue:12 Volume:11 Page:2668
ISSN:2227-7390
Container-title:Mathematics
language:en
Short-container-title:Mathematics

Author:

Ahmad Pir Noman¹,Liu Yuanchao¹,Ali Gauhar²^ORCID,Wani Mudasir Ahmad²^ORCID,ElAffendi Mohammed²^ORCID

Affiliation:

1. School of Computer Science and Technology, Harbin Institute of Technology, Harbin 150001, China

2. EIAS Data Science and Blockchain Lab, College of Computer and Information Sciences, Prince Sultan University, Riyadh 11586, Saudi Arabia

Abstract

Social media, fake news, and different propaganda strategies have all contributed to an increase in misinformation online during the past ten years. As a result of the scarcity of high-quality data, the present datasets cannot be used to train a deep-learning model, making it impossible to establish an identification. We used a natural language processing approach to the issue in order to create a system that uses deep learning to automatically identify propaganda in news items. To assist the scholarly community in identifying propaganda in text news, this study suggested the propaganda texts (ProText) library. Truthfulness labels are assigned to ProText repositories after being manually and automatically verified with fact-checking methods. Additionally, this study proposed using a fine-tuned Robustly Optimized BERT Pre-training Approach (RoBERTa) and word embedding using multi-label multi-class text classification. Through experimentation and comparative research analysis, we address critical issues and collaborate to discover answers. We achieved an evaluation performance accuracy of 90%, 75%, 68%, and 65% on ProText, PTC, TSHP-17, and Qprop, respectively. The big-data method, particularly with deep-learning models, can assist us in filling out unsatisfactory big data in a novel text classification strategy. We urge collaboration to inspire researchers to acquire, exchange datasets, and develop a standard aimed at organizing, labeling, and fact-checking.

Funder

Prince Sultan University

Publisher

MDPI AG

Subject

General Mathematics,Engineering (miscellaneous),Computer Science (miscellaneous)

Link

https://www.mdpi.com/2227-7390/11/12/2668/pdf

Reference89 articles.

1. Ahmed, S., Hinkelmann, K., and Corradini, F. (2022). Combating Fake News with Computational Intelligence Techniques, Springer.

2. Hao, F., Yang, Y., Shang, J., and Park, D.-S. (2023). IEEE Transactions on Computational Social Systems, IEEE.

3. Propaganda Detection And Challenges Managing Smart Cities Information On Social Media;Ahmad;EAI Endorsed Trans. Smart Cities,2023

4. Khanday, A.M.U.D., Wani, M.A., Rabani, S.T., and Khan, Q.R. (2023). Hybrid Approach for Detecting Propagandistic Community and Core Node on Social Networks. Sustainability, 15.

5. Proppy: Organizing the News Based on Their Propagandistic Content;Jaradat;Inf. Process. Manag.,2019

Cited by 2 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. DJMF: A discriminative joint multi-task framework for multimodal sentiment analysis based on intra- and inter-task dynamics;Expert Systems with Applications;2024-05

2. Enhancing Coherence and Diversity in Multi-Class Slogan Generation Systems;ACM Transactions on Asian and Low-Resource Language Information Processing;2023-12-15