Abstract
It is essential for research communities to investigate ways for authenticating news. The use of linguistic feature based analysis to automatically detect false news is gaining popularity among the scientific community. However, such techniques are exclusively created for English, leaving low-resource languages like Hindi behind. To address this issue, we constructed a novel annotated Hindi Fake News (HinFakeNews) dataset of roughly 33,300 articles that can be utilized to develop autonomous fake news detection systems. This work provides a two-stage benchmark model for identifying fake news in Hindi using machine learning. The proposed model, LFWE (Linguistic Feature Based Word Embedding), generates word embedding over linguistic features. This article focuses on 23 key linguistic features (15 extracted and 08 derived) for successful detection of Hindi fake news. These features are grouped as lexical, semantic, syntactic, psycho-linguistic, readability, and quantity features. The contribution is twofold. In the first phase, the dataset is preprocessed and linguistic features are extracted. In the second phase, feature sets are generated as word embeddings, and an Ensemble voting classification is carried out on the feature sets. According to experimental findings, the LFWE model accurately detects and classifies fake news in Hindi with an accuracy of 98.49%.
Publisher
Association for Computing Machinery (ACM)
Reference51 articles.
1. S. Rukmini. 2019. In India who speaks in English and where? from. https://www.livemint.com/news/india/in-india-who-speaks-in-english-and-where-1557814101428.html.
2. Language-Independent Fake News Detection: English, Portuguese, and Spanish Mutual Features
3. Syeda Zainab Akbar, Divyanshu Kukreti, Somya Sagarika, and Joyojeet Pal. 2020. Temporal Patterns in COVID-19 Related Digital Misinformation in India. Retrieved April 7, 2023 from http://joyojeet.people.si.umich.edu/temporal-patterns-in-covid-19-misinformation-in-india/.
4. Where is Your Evidence: Improving Fact-checking by Justification Modeling
5. Social Media and Fake News in the 2016 Election
Cited by
3 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献