A comparative analysis of machine learning algorithms for hate speech detection in social media-Reference-Cited by-同舟云学术

A comparative analysis of machine learning algorithms for hate speech detection in social media

Published:2023-10-01 Issue:4 Volume:13 Page:e202348
ISSN:1986-3497
Container-title:Online Journal of Communication and Media Technologies
language:
Short-container-title:ONLINE J COMMUN MEDIA TECHNOL

Author:

Omran Esraa¹^ORCID,Al Tararwah Estabraq²^ORCID,Al Qundus Jamal³^ORCID

Affiliation:

1. Center for Applied Mathematics and Bioinformatics, Department of Computer Science, Gulf University for Science and Technology, Kuwait City, KUWAIT

2. Gulf University for Science and Technology, Kuwait City, KUWAIT

3. Faculty of Information Technology, Middle East University, Amman, JORDAN

Abstract

A<b> </b>detecting and mitigating hate speech in social media, particularly on platforms like Twitter, is a crucial task with significant societal impact. This research study presents a comprehensive comparative analysis of machine learning algorithms for hate speech detection, with the primary goal of identifying an optimal algorithmic combination that is simple, easy to implement, efficient, and yields high detection performance. Through meticulous pre-processing and rigorous evaluation, the study explores various algorithms to determine their suitability for hate speech detection. The focus is finding a combination that balances simplicity, ease of implementation, computational efficiency, and strong performance metrics. The findings reveal that the combination of naïve Bayes and decision tree algorithms achieves a high accuracy of 0.887 and an F1-score of 0.885, demonstrating its effectiveness in hate speech detection. This research contributes to identifying a reliable algorithmic combination that meets the criteria of simplicity, ease of implementation, quick processing, and strong performance, providing valuable guidance for researchers and practitioners in hate speech detection in social media. By elucidating the strengths and limitations of various algorithmic combinations, this research enhances the understanding of hate speech detection. It paves the way for developing robust solutions, creating a safer, more inclusive digital environment.

Publisher

Bastas Publications

Subject

Computer Science Applications,Media Technology,Education,Communication

Link

https://www.ojcmt.net/download/a-comparative-analysis-of-machine-learning-algorithms-for-hate-speech-detection-in-social-media-13603.pdf

Reference26 articles.

1. Anand, M., Sahay, K. B., Ahmed, M. A., Sultan, D., Chandan, R. R., 6 Singh, B. (2023). Deep learning and natural language processing in computation for offensive language detection in online social networks by feature selection and ensemble classification techniques. Theoretical Computer Science, 943, 203-218. https://doi.org/10.1016/j.tcs.2022.06.020

2. Bansal, M., Goyal, A., & Choudhary, A. (2022). A comparative analysis of k-nearest neighbor, genetic, support vector machine, decision tree, and long short term memory algorithms in machine learning. Decision Analytics Journal, 3, 100071. https://doi.org/10.1016/j.dajour.2022.100071

3. Connolly, T. M., & Begg, C. E. (2005). Database systems: A practical approach to design, implementation, and management. Pearson Education.

4. Das, S., Bhattacharyya, K., & Sarkar, S. (2023). Performance analysis of logistic regression, naïve Bayes, KNN, decision tree, random forest and SVM on hate speech detection from Twitter. International Research Journal of Innovations in Engineering and Technology, 7(3), 24-28.

5. Davidson, T., Warmsley, D., Macy, M., & Weber, I. (2017). Automated hate speech detection and the problem of offensive language. Proceedings of the International AAAI Conference on Web and Social Media, 11(1), 512-515. https://doi.org/10.1609/icwsm.v11i1.14955