Novel Hate Speech Detection Using Word Cloud Visualization and Ensemble Learning Coupled with Count Vectorizer-Reference-Cited by-同舟云学术

Novel Hate Speech Detection Using Word Cloud Visualization and Ensemble Learning Coupled with Count Vectorizer

Published:2022-06-29 Issue:13 Volume:12 Page:6611
ISSN:2076-3417
Container-title:Applied Sciences
language:en
Short-container-title:Applied Sciences

Author:

Turki Turki^ORCID,Roy Sanjiban Sekhar^ORCID

Abstract

A plethora of negative behavioural activities have recently been found in social media. Incidents such as trolling and hate speech on social media, especially on Twitter, have grown considerably. Therefore, detection of hate speech on Twitter has become an area of interest among many researchers. In this paper, we present a computational framework to (1) examine out the computational challenges behind hate speech detection and (2) generate high performance results. First, we extract features from Twitter data by utilizing a count vectorizer technique. Then, we provide the labeled dataset of constructed features to adopted ensemble methods, including Bagging, AdaBoost, and Random Forest. After training, we classify new tweet examples into one of the two categories, hate speech or non-hate speech. Experimental results show (1) that Random Forest has surpassed other methods by generating 95% using accuracy performance results and (2) word cloud displays the most prominent tweets that are responsible for hateful sentiments.

Funder

King Abdulaziz University

Publisher

MDPI AG

Subject

Fluid Flow and Transfer Processes,Computer Science Applications,Process Chemistry and Technology,General Engineering,Instrumentation,General Materials Science

Link

https://www.mdpi.com/2076-3417/12/13/6611/pdf

Reference46 articles.

1. An Ensemble Method for Radicalization and Hate Speech Detection Online Empowered by Sentic Computing

2. Hate speech detection: Challenges and solutions

3. Offensive language detection on social media based on text classification;Hajibabaee;Proceedings of the 2022 IEEE 12th Annual Computing and Communication Workshop and Conference (CCWC),2022

4. Machine Learning and feature engineering-based study into sarcasm and irony classification with application to cyberbullying detection

5. Detection and fine-grained classification of cyberbullying events;Van Hee;Proceedings of the International Conference Recent Advances in Natural Language Processing, RANLP,2015

Cited by 22 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. CLASSIFICATION OF CUSTOMER SENTIMENTS BASED ON ONLINE REVIEWS: COMPARATIVE ANALYSIS OF MACHINE LEARNING AND DEEP LEARNING ALGORITHMS;Kahramanmaraş Sütçü İmam Üniversitesi Mühendislik Bilimleri Dergisi;2024-09-03

2. Leveraging Domain-Specific Word Embedding and Hate Concepts in Hate Speech Detection;2024 International Joint Conference on Neural Networks (IJCNN);2024-06-30

3. Ebola optimization based spiking neural network for automatic hate speech recognition;International Journal of Information Technology;2024-06-26

4. Application of Natural Language Processing and Genetic Algorithm to Fine-Tune Hyperparameters of Classifiers for Economic Activities Analysis;Big Data and Cognitive Computing;2024-06-13

5. A sentiment analysis approach for understanding users’ perception of metaverse marketplace;Intelligent Systems with Applications;2024-06