Social Media Hate Speech Detection Using Explainable Artificial Intelligence (XAI)-Reference-Cited by-同舟云学术

Social Media Hate Speech Detection Using Explainable Artificial Intelligence (XAI)

Published:2022-08-17 Issue:8 Volume:15 Page:291
ISSN:1999-4893
Container-title:Algorithms
language:en
Short-container-title:Algorithms

Author:

Mehta Harshkumar,Passi Kalpdrum^ORCID

Abstract

Explainable artificial intelligence (XAI) characteristics have flexible and multifaceted potential in hate speech detection by deep learning models. Interpreting and explaining decisions made by complex artificial intelligence (AI) models to understand the decision-making process of these model were the aims of this research. As a part of this research study, two datasets were taken to demonstrate hate speech detection using XAI. Data preprocessing was performed to clean data of any inconsistencies, clean the text of the tweets, tokenize and lemmatize the text, etc. Categorical variables were also simplified in order to generate a clean dataset for training purposes. Exploratory data analysis was performed on the datasets to uncover various patterns and insights. Various pre-existing models were applied to the Google Jigsaw dataset such as decision trees, k-nearest neighbors, multinomial naïve Bayes, random forest, logistic regression, and long short-term memory (LSTM), among which LSTM achieved an accuracy of 97.6%. Explainable methods such as LIME (local interpretable model—agnostic explanations) were applied to the HateXplain dataset. Variants of BERT (bidirectional encoder representations from transformers) model such as BERT + ANN (artificial neural network) with an accuracy of 93.55% and BERT + MLP (multilayer perceptron) with an accuracy of 93.67% were created to achieve a good performance in terms of explainability using the ERASER (evaluating rationales and simple English reasoning) benchmark.

Publisher

MDPI AG

Subject

Computational Mathematics,Computational Theory and Mathematics,Numerical Analysis,Theoretical Computer Science

Link

https://www.mdpi.com/1999-4893/15/8/291/pdf

Reference35 articles.

1. Automated Hate Speech Detection and the Problem of Offensive Language http://arxiv.org/abs/1703.04009

2. Detecting Offensive Language in Social Media to Protect Adolescent Online Safety

3. Necessity and sufficiency for explaining text classifiers: A case study in hate speech detection;Balkir;arXiv,2022

4. Mean Birds

Cited by 26 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Robustness of models addressing Information Disorder: A comprehensive review and benchmarking study;Neurocomputing;2024-09

2. Natural Language Processing–Powered Real-Time Monitoring Solution for Vaccine Sentiments and Hesitancy on Social Media: System Development and Validation;JMIR Medical Informatics;2024-06-21

3. Methods, Techniques, and Application of Explainable Artificial Intelligence;Advances in Environmental Engineering and Green Technologies;2024-06-07

4. Abusive Comment Detection in Tamil Code-Mixed Data by Adjusting Class Weights and Refining Features;ACM Transactions on Asian and Low-Resource Language Information Processing;2024-05-18

5. Study of Deep Learning Techniques for Real-Time Online censorship using Comment Toxicity Detection;2024 MIT Art, Design and Technology School of Computing International Conference (MITADTSoCiCon);2024-04-25