Anatomy of Online Hate: Developing a Taxonomy and Machine Learning Models for Identifying and Classifying Hate in Online News Media-Reference-Cited by-同舟云学术

Anatomy of Online Hate: Developing a Taxonomy and Machine Learning Models for Identifying and Classifying Hate in Online News Media

Published:2018-06-15 Issue:1 Volume:12 Page:
ISSN:2334-0770
Container-title:Proceedings of the International AAAI Conference on Web and Social Media
language:
Short-container-title:ICWSM

Author:

Salminen Joni,Almerekhi Hind,Milenković Milica,Jung Soon-gyo,An Jisun,Kwak Haewoon,Jansen Bernard

Abstract

Online social media platforms generally attempt to mitigate hateful expressions, as these comments can be detrimental to the health of the community. However, automatically identifying hateful comments can be challenging. We manually label 5,143 hateful expressions posted to YouTube and Facebook videos among a dataset of 137,098 comments from an online news media. We then create a granular taxonomy of different types and targets of online hate and train machine learning models to automatically detect and classify the hateful comments in the full dataset. Our contribution is twofold: 1) creating a granular taxonomy for hateful online comments that includes both types and targets of hateful comments, and 2) experimenting with machine learning, including Logistic Regression, Decision Tree, Random Forest, Adaboost, and Linear SVM, to generate a multiclass, multilabel classification model that automatically detects and categorizes hateful comments in the context of online news media. We find that the best performing model is Linear SVM, with an average F1 score of 0.79 using TF-IDF features. We validate the model by testing its predictive ability, and, relatedly, provide insights on distinct types of hate speech taking place on social media.

Publisher

Association for the Advancement of Artificial Intelligence (AAAI)

Cited by 40 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. A single modality apparent first impression personality recognition model with temporal emotion based LSTM;Expert Systems with Applications;2025-01

2. User-Centric Modeling of Online Hate Through the Lens of Psycholinguistic Patterns and Behaviors in Social Media;IEEE Transactions on Computational Social Systems;2024-06

3. Where do cross-cutting discussions happen?: Identifying cross-cutting comments on YouTube videos of political vloggers and mainstream news outlets;PLOS ONE;2024-05-29

4. Safeguarding human values: rethinking US law for generative AI’s societal impacts;AI and Ethics;2024-05-07

5. Misinformation as a Harm: Structured Approaches for Fact-Checking Prioritization;Proceedings of the ACM on Human-Computer Interaction;2024-04-17