Hate speech detection in Twitter using hybrid embeddings and improved cuckoo search-based neural networks-Reference-Cited by-同舟云学术

Hate speech detection in Twitter using hybrid embeddings and improved cuckoo search-based neural networks

Published:2020-10-12 Issue:4 Volume:13 Page:485-525
ISSN:1756-378X
Container-title:International Journal of Intelligent Computing and Cybernetics
language:en
Short-container-title:IJICC

Author:

Ayo Femi Emmanuel^ORCID,Folorunso Olusegun,Ibharalu Friday Thomas,Osinuga Idowu Ademola

Abstract

PurposeHate speech is an expression of intense hatred. Twitter has become a popular analytical tool for the prediction and monitoring of abusive behaviors. Hate speech detection with social media data has witnessed special research attention in recent studies, hence, the need to design a generic metadata architecture and efficient feature extraction technique to enhance hate speech detection.Design/methodology/approachThis study proposes a hybrid embeddings enhanced with a topic inference method and an improved cuckoo search neural network for hate speech detection in Twitter data. The proposed method uses a hybrid embeddings technique that includes Term Frequency-Inverse Document Frequency (TF-IDF) for word-level feature extraction and Long Short Term Memory (LSTM) which is a variant of recurrent neural networks architecture for sentence-level feature extraction. The extracted features from the hybrid embeddings then serve as input into the improved cuckoo search neural network for the prediction of a tweet as hate speech, offensive language or neither.FindingsThe proposed method showed better results when tested on the collected Twitter datasets compared to other related methods. In order to validate the performances of the proposed method, t-test and post hoc multiple comparisons were used to compare the significance and means of the proposed method with other related methods for hate speech detection. Furthermore, Paired Sample t-Test was also conducted to validate the performances of the proposed method with other related methods.Research limitations/implicationsFinally, the evaluation results showed that the proposed method outperforms other related methods with mean F1-score of 91.3.Originality/valueThe main novelty of this study is the use of an automatic topic spotting measure based on naïve Bayes model to improve features representation.

Publisher

Emerald

Subject

General Computer Science

Reference127 articles.

1. Aggarwal, C.C. (2011), “An introduction to social network data analytics”, in Aggarwal, C.C. (Ed.), Social Network Data Analytics, Springer, New York, pp. 1-15.

2. A simple but tough-to-beat baseline for sentence embeddings,2016

3. A survey of techniques for event detection in Twitter;Computational Intelligence,2015

4. Deep learning for hate speech detection in tweets,2017

Cited by 15 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Cross-Language Offensive Speech Detection Using the mBERT Model;International Journal of Computer Science and Information Technology;2024-08-12

2. Detecting Offensive Language on Malay Social Media: A Zero-Shot, Cross-Language Transfer Approach Using Dual-Branch mBERT;Applied Sciences;2024-07-02

3. An Open Source Framework for Benchmarking Cyberbullying Detection on Social Media;2024 IEEE World AI IoT Congress (AIIoT);2024-05-29

4. Leveraging Transfer Learning for Hate Speech Detection in Portuguese Social Media Posts;IEEE Access;2024

5. Systematic meta-analysis of research on AI tools to deal with misinformation on social media during natural and anthropogenic hazards and disasters;Humanities and Social Sciences Communications;2023-06-17