Offensive Comments in the Brazilian Web: a dataset and baseline results-Reference-Cited by-同舟云学术

Offensive Comments in the Brazilian Web: a dataset and baseline results

Published:2017-07-06 Issue: Volume: Page:
ISSN:
Container-title:Anais do Brazilian Workshop on Social Network Analysis and Mining (BraSNAM)
language:
Short-container-title:

Author:

De Pelle Rogers Prates,Moreira Viviane P.

Abstract

Brazilian Web users are among the most active in social networks and very keen on interacting with others. Offensive comments, known as hate speech, have been plaguing online media and originating a number of lawsuits against companies which publish Web content. Given the massive number of user generated text published on a daily basis, manually filtering offensive comments becomes infeasible. The identification of offensive comments can be treated as a supervised classification task. In order to obtain a model to classify comments, an annotated dataset containing positive and negative examples is necessary. The lack of such a dataset in Portuguese, limits the development of detection approaches for this language. In this paper, we describe how we created annotated datasets of offensive comments for Portuguese by collecting news comments on the Brazilian Web. In addition, we provide classification results achieved by standard classification algorithms on these datasets which can serve as baseline for future work on this topic.

Publisher

Sociedade Brasileira de Computação - SBC

Cited by 28 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Explainable hate speech detection using LIME;International Journal of Speech Technology;2024-08-30

2. Abordagem Semi-Supervisionada para Anotação de Linguagem Tóxica;Anais do XIII Brazilian Workshop on Social Network Analysis and Mining (BraSNAM 2024);2024-07-21

3. Automatic hate speech detection in audio using machine learning algorithms;International Journal of Speech Technology;2024-06

4. A survey on multi-lingual offensive language detection;PeerJ Computer Science;2024-03-29

5. Reality television and the promotion of problematic behavior among cast members: a case study content analysis through the lens of feminist and media framing theories;Feminist Media Studies;2024-02-23