Abstract
THIS ARTICLE USES WORDS OR LANGUAGE THAT IS CONSIDERED PROFANE, VULGAR, OR OFFENSIVE BY SOME READERS. Hate speech detection in online social networks is a multidimensional problem, dependent on language and cultural factors. Most supervised learning resources for this task, such as labeled datasets and Natural Language Processing (NLP) tools, have been specifically tailored for English. However, a large portion of web users around the world speak different languages, creating an important need for efficient multilingual hate speech detection approaches. In particular, such approaches should be able to leverage the limited cross-lingual resources currently existing in their learning process. The cross-lingual transfer in this task has been difficult to achieve successfully. Therefore, we propose a simple yet effective method to approach this problem. To our knowledge, ours is the first attempt to create a multilingual embedding model specific to this problem. We validate the effectiveness of our approach by performing an extensive comparative evaluation against several well-known general-purpose language models that, unlike ours, have been trained on massive amounts of data. We focus on a zero-shot cross-lingual evaluation scenario in which we classify hate speech in one language without having access to any labeled data. Despite its simplicity, our embeddings outperform more complex models for most experimental settings we tested. In addition, we provide further evidence of the effectiveness of our approach through an ad hoc qualitative exploratory analysis, which captures how hate speech is displayed in different languages. This analysis allows us to find new cross-lingual relations between words in the hate-speech domain. Overall, our findings indicate common patterns in how hate speech is expressed across languages and that our proposed model can capture such relationships significantly.
Publisher
Public Library of Science (PLoS)
Reference62 articles.
1. Badjatiya, Pinkesh and Gupta, Shashank and Gupta, Manish and Varma, Vasudeva. Deep learning for hate speech detection in tweets. Proc. 26th WWW Companion. 759–760. 2017
2. Sweta Agrawal and Amit Awekar. Deep Learning for Detecting Cyberbullying Across Multiple Social Media Platforms. Proc. 40th ECIR. 141–153. 2018.
3. Despoina Chatzakou, Nicolas Kourtellis, Jeremy Blackburn, Emiliano De Cristofaro, Gianluca Stringhini, Athena Vakali. Mean Birds: Detecting Aggression and Bullying on Twitter. Proc. ACM on Web Science Conference, WebSci. 13–22. 2017.
4. Hosseinmardi, Homa and Mattson, Sabrina Arredondo and Rafiq, Rahat Ibn and Han, Richard and Lv, Qin and Mishra, Shivakant. Analyzing labeled cyberbullying incidents on the Instagram social network. International Conference on Social Informatics. 49–66. 2015.
5. Lukas Stappen, Fabian Brunn, Björn W. Schuller. Cross-lingual Zero- and Few-shot Hate Speech Detection Utilising Frozen Transformer Language Models and AXEL. CoRR. abs/2004.13850.