Affiliation:
1. College of Computer Science and Engineering, Taibah University, Yanbu 966144, Saudi Arabia
2. Faculty of Science, Tanta University, Tanta 31527, Egypt
Abstract
Hate speech towards a group or an individual based on their perceived identity, such as ethnicity, religion, or nationality, is widely and rapidly spreading on social media platforms. This causes harmful impacts on users of these platforms and the quality of online shared content. Fortunately, researchers have developed different machine learning algorithms to automatically detect hate speech on social media platforms. However, most of these algorithms focus on the detection of hate speech that appears in English. There is a lack of studies on the detection of hate speech in Arabic due to the language’s complex nature. This paper aims to address this issue by proposing an effective approach for detecting Arabic hate speech on social media platforms, namely Twitter. Therefore, this paper introduces the Arabic BERT-Mini Model (ABMM) to identify hate speech on social media. More specifically, the bidirectional encoder representations from transformers (BERT) model was employed to analyze data collected from Twitter and classify the results into three categories: normal, abuse, and hate speech. In order to evaluate our model and state-of-the-art approaches, we conducted a series of experiments on Twitter data. In comparison with previous works on Arabic hate-speech detection, the ABMM model shows very promising results with an accuracy score of 0.986 compared to the other models.
Subject
Electrical and Electronic Engineering,Computer Networks and Communications,Hardware and Architecture,Signal Processing,Control and Systems Engineering
Reference40 articles.
1. ‘Misinformation? What of it?’ Motivations and individual differences in misinformation sharing on social media;Chen;Proc. Am. Soc. Inf. Sci. Technol.,2013
2. Fanning the Flames of Hate: Social Media and Hate Crime;Schwarz;J. Eur. Econ. Assoc.,2020
3. HANN: Hybrid Attention Neural Network for Detecting Covid-19 Related Rumors;Almars;IEEE Access,2022
4. Nobata, C., Tetreault, J., Thomas, A., Mehdad, Y., and Chang, Y. (2016, January 11–15). Abusive Language Detection in Online User Content. Proceedings of the 25th International Conference on World Wide Web, Montreal, QC, Canada.
5. Waseem, Z., and Hovy, D. (2016, January 12–17). Hateful Symbols or Hateful People? Predictive Features for Hate Speech Detection on Twitter. Proceedings of the NAACL-HLT, San Diego, CA, USA.
Cited by
18 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献