A Literature Review of Textual Hate Speech Detection Methods and Datasets-Reference-Cited by-同舟云学术

A Literature Review of Textual Hate Speech Detection Methods and Datasets

Published:2022-05-26 Issue:6 Volume:13 Page:273
ISSN:2078-2489
Container-title:Information
language:en
Short-container-title:Information

Author:

Alkomah Fatimah,Ma Xiaogang^ORCID

Abstract

Online toxic discourses could result in conflicts between groups or harm to online communities. Hate speech is complex and multifaceted harmful or offensive content targeting individuals or groups. Existing literature reviews have generally focused on a particular category of hate speech, and to the best of our knowledge, no review has been dedicated to hate speech datasets. This paper systematically reviews textual hate speech detection systems and highlights their primary datasets, textual features, and machine learning models. The results of this literature review are integrated with content analysis, resulting in several themes for 138 relevant papers. This study shows several approaches that do not provide consistent results in various hate speech categories. The most dominant sets of methods combine more than one deep learning model. Moreover, the analysis of several hate speech datasets shows that many datasets are small in size and are not reliable for various tasks of hate speech detection. Therefore, this study provides the research community with insights and empirical evidence on the intrinsic properties of hate speech and helps communities identify topics for future work.

Publisher

MDPI AG

Subject

Information Systems

Link

https://www.mdpi.com/2078-2489/13/6/273/pdf

Reference159 articles.

1. Resources and benchmark corpora for hate speech detection: a systematic review

2. Change Point Detection in Terrorism-Related Online Content Using Deep Learning Derived Indicators

3. An Interdisciplinary Scientific and Mathematic Education, Addressing Relevant Social Problems Such as Sexist Hate Speech

4. A Measurement Study of Hate Speech in Social Media

5. Detection of Hate Speech Texts Using Machine Learning Algorithm

Cited by 53 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Robustness of models addressing Information Disorder: A comprehensive review and benchmarking study;Neurocomputing;2024-09

2. Capturing the Spectrum of Social Media Conflict: A Novel Multi-objective Classification Model;Proceedings of the 2024 ACM SIGIR International Conference on Theory of Information Retrieval;2024-08-02

3. Klasifikasi Hate Speech dan Emosi Dalam Teks Berbahasa Indonesia Pada Pengguna Twitter Menggunakan Metode Naïve Bayes Classifier;Indonesian Journal of Applied Technology;2024-07-26

4. Hate speech detection in the Bengali language: a comprehensive survey;Journal of Big Data;2024-07-23

5. Artificial Intelligence Applications for Workplace Safety;Advances in Information Quality and Management;2024-07-01