Affiliation:
1. GLA University Mathura Mathura India
Abstract
AbstractFake news classification emerged as an exciting topic for machine learning and artificial intelligence researchers. Most of the existing literature on fake news detection is based on the English language. Hence, it needs more usability. Fake news detection in low‐resource scare languages is still challenging due to the absence of large annotated datasets and tools. In this work, we propose a large‐scale Indian news dataset for the Hindi language. This dataset is constructed by scraping different reliable fact‐checking websites. The LDA approach is adopted to assign the category to news statements. Various machine‐learning and transfer learning approaches are applied to verify the authenticity of the dataset. Ensemble learning is also applied based on the low false‐positive rate of machine‐learning classifiers. A multi‐modal approach is adopted by combining LSTM with VGG‐16 and VGG‐19 classifiers. LSTM is used for textual features, while VGG‐16 and VGG‐19 are applied for image analysis. Our proposed dataset has achieved satisfactory performance.
Funder
Council of Science and Technology, U.P.
Reference19 articles.
1. https://wiki2.org/en/List_of_languages_by_number_of_native_speakers
2. Role of cybersecurity and Blockchain in battlefield of things;Sharma G;Int Technol Lett,2023
3. Times of India. Accesssed 12 june 2023https://timesofindia.indiatimes.com/tpoint_cmtofart/8693029.cms?msid=8693029
4. GuptaA SukumaranR JohnK TekiS.Hostility detection and COVID‐19 fake news detection in social media.2021arXiv preprint arXiv:2101.05953.
5. KamalO KumarA VaidhyaT.Hostility detection in hindi leveraging pre‐trained language models.2021arXiv preprint arXiv:2101.05494.
Cited by
1 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献