Detection of Hate Speech, Racism and Misogyny in Digital Social Networks: Colombian Case Study-Reference-Cited by-同舟云学术

Detection of Hate Speech, Racism and Misogyny in Digital Social Networks: Colombian Case Study

Published:2024-09-06 Issue:9 Volume:8 Page:113
ISSN:2504-2289
Container-title:Big Data and Cognitive Computing
language:en
Short-container-title:BDCC

Author:

Moreno-Sandoval Luis Gabriel¹^ORCID,Pomares-Quimbaya Alexandra¹^ORCID,Barbosa-Sierra Sergio Andres¹^ORCID,Pantoja-Rojas Liliana Maria²^ORCID

Affiliation:

1. Engineering Faculty, Pontificia Universidad Javeriana, Bogotá 110231, Colombia

2. Engineering Faculty, Universidad Distrital Francisco José de Caldas, Bogotá 111611, Colombia

Abstract

The growing popularity of social networking platforms worldwide has substantially increased the presence of offensive language on these platforms. To date, most of the systems developed to mitigate this challenge focus primarily on English content. However, this issue is a global concern, and therefore, other languages, such as Spanish, are involved. This article addresses the task of identifying hate speech, racism, and misogyny in Spanish within the Colombian context on social networks, and introduces a gold standard dataset specifically developed for this purpose. Indeed, the experiment compares the performance of TLM models from Deep Learning methods, such as BERT, Roberta, XLM, and BETO adjusted to the Colombian slang domain, then compares the best TLM model against a GPT, having a significant impact on achieving more accurate predictions in this task. Finally, this study provides a detailed understanding of the different components used in the system, including the architecture of the models and the selection of functions. The best results show that the BERT model achieves an accuracy of 83.6% for hate speech detection, while the GPT model achieves an accuracy of 90.8% for racism speech and 90.4% for misogyny detection.

Funder

Pontificia Universidad Javeriana

Publisher

MDPI AG

Link

https://www.mdpi.com/2504-2289/8/9/113/pdf

Reference83 articles.

1. Ash Turner (2023, November 15). How Many Users Does Twitter Have?. Available online: https://www.bankmycell.com/blog/how-many-users-does-twitter-have.

2. LibertiesEU (2023, May 25). Freedom of Expression on Social Media: Filtering Methods, Rights, and Future Perspectives. Available online: https://www.liberties.eu/es/stories/libertad-expresion-redes-sociales/43773.

3. Zhang, Z., and Luo, L. (2018). Hate Speech Detection: A Solved Problem? The Challenging Case of Long Tail on Twitter. arXiv.

4. Misogyny Detection in Twitter: A Multilingual and Cross-Domain Study;Pamungkas;Inf. Process. Manag.,2020

5. International Telecommunication Union, ITU Publications (2024, April 05). Measuring Digital Development: Facts and Figures 2022. Available online: https://www.itu.int/hub/publication/d-ind-ict_mdd-2022/.