Automated Hate Speech Detection and the Problem of Offensive Language-Reference-Cited by-同舟云学术

Automated Hate Speech Detection and the Problem of Offensive Language

Published:2017-05-03 Issue:1 Volume:11 Page:512-515
ISSN:2334-0770
Container-title:Proceedings of the International AAAI Conference on Web and Social Media
language:
Short-container-title:ICWSM

Author:

Davidson Thomas,Warmsley Dana,Macy Michael,Weber Ingmar

Abstract

A key challenge for automatic hate-speech detection on social media is the separation of hate speech from other instances of offensive language. Lexical detection methods tend to have low precision because they classify all messages containing particular terms as hate speech and previous work using supervised learning has failed to distinguish between the two categories. We used a crowd-sourced hate speech lexicon to collect tweets containing hate speech keywords. We use crowd-sourcing to label a sample of these tweets into three categories: those containing hate speech, only offensive language, and those with neither. We train a multi-class classifier to distinguish between these different categories. Close analysis of the predictions and the errors shows when we can reliably separate hate speech from other offensive language and when this differentiation is more difficult. We find that racist and homophobic tweets are more likely to be classified as hate speech but that sexist tweets are generally classified as offensive. Tweets without explicit hate keywords are also more difficult to classify.

Publisher

Association for the Advancement of Artificial Intelligence (AAAI)

Cited by 635 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Exposing the Achilles’ heel of textual hate speech classifiers using indistinguishable adversarial examples;Expert Systems with Applications;2024-11

2. People who share encounters with racism are silenced online by humans and machines, but a guideline-reframing intervention holds promise;Proceedings of the National Academy of Sciences;2024-09-09

3. Context-aware and expert data resources for Brazilian Portuguese hate speech detection;Natural Language Processing;2024-09-06

4. A visual approach to tracking emotional sentiment dynamics in social network commentaries;Social Network Analysis and Mining;2024-09-05

5. Study on relationship between adversarial texts and language errors: a human-computer interaction perspective;Behaviour & Information Technology;2024-09-04