Fake news detection based on a hybrid BERT and LightGBM models

Author:

Essa EhabORCID,Omar Karima,Alqahtani Ali

Abstract

AbstractWith the rapid growth of social networks and technology, knowing what news to believe and what not to believe become a challenge in this digital era. Fake news is defined as provably erroneous information transmitted intending to defraud. This kind of misinformation poses a serious threat to social cohesion and well-being, since it fosters political polarisation and can destabilize trust in the government or the service provided. As a result, fake news detection has emerged as an important field of study, with the goal of identifying whether a certain piece of content is real or fake. In this paper, we propose a novel hybrid fake news detection system that combines a BERT-based (bidirectional encoder representations from transformers) with a light gradient boosting machine (LightGBM) model. We compare the performance of the proposed method to four different classification approaches using different word embedding techniques on three real-world fake news datasets to validate the performance of the proposed method compared to other methods. The proposed method is evaluated to detect fake news based on the headline-only or full text of the news content. The results show the superiority of the proposed method for fake news detection compared to many state-of-the-art methods.

Funder

Deanship of Scientific Research, King Khalid University

Publisher

Springer Science and Business Media LLC

Subject

Computational Mathematics,Engineering (miscellaneous),Information Systems,Artificial Intelligence

Reference44 articles.

1. Lazer DMJ, Baum MA, Benkler Y, Berinsky AJ, Greenhill KM, Menczer F, Metzger MJ, Nyhan B, Pennycook G, Rothschild D, Schudson M, Sloman SA, Sunstein CR, Thorson EA, Watts DJ, Zittrain JL (2018) The science of fake news. Science 359(6380):1094–1096. https://doi.org/10.1126/science.aao2998

2. Weedon J, Nuland W, Stamos A (2017) Information operations and Facebook. Retrieved from Facebook: https://fbnewsroomus.files.wordpress.com/2017/04/facebook-and-information-operations-v1.pdf, https://about.fb.com/br/wpcontent/uploads/sites/3/2017/09/facebook-and-information-operations-v1.pdf. Accessed 15 Aug 2022

3. Gunther R, Beck PA, Nisbet EC (2018) Fake news may have contributed to trump’s 2016 victory. Ohio state university, [Online]. https://www.documentcloud.org/documents/4429952-Fake-News-May-Have-Contributed-to-Trump-s-2016.html. Accessed 24 Aug 2022

4. The Economic Times (2022) Fake news of Tesla acquiring lithium miner sent its stock up over 250%. [Online]. https://economictimes.indiatimes.com/tech/tech-bytes/fake-news-of-tesla-acquiring-lithium-miner-sent-its-stock-up-over-250/articleshow/90835997.cms. Accessed 24 Aug 2022

5. Kouzy R, Abi Jaoude J, Kraitem A, El Alam MB, Karam B, Adib E, Zarka J, Traboulsi C, Akl EW, Baddour K (2020) Coronavirus goes viral: quantifying the covid-19 misinformation epidemic on twitter. Cureus 12(3):e7255

Cited by 14 articles. 订阅此论文施引文献 订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献

1. Fake news detection: recent trends and challenges;Social Network Analysis and Mining;2024-08-30

2. BERTGuard: Two-Tiered Multi-Domain Fake News Detection with Class Imbalance Mitigation;Big Data and Cognitive Computing;2024-08-16

3. GBERT: A hybrid deep learning model based on GPT-BERT for fake news detection;Heliyon;2024-08

4. FakeWatch : a framework for detecting fake news to ensure credible elections;Social Network Analysis and Mining;2024-07-22

5. A Pipeline for Monitoring and Maintaining a Text Classification Tool in Production;Anais do LI Seminário Integrado de Software e Hardware (SEMISH 2024);2024-07-21

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3