A Study of Multilingual Toxic Text Detection Approaches under Imbalanced Sample Distribution-Reference-Cited by-同舟云学术

A Study of Multilingual Toxic Text Detection Approaches under Imbalanced Sample Distribution

Published:2021-05-12 Issue:5 Volume:12 Page:205
ISSN:2078-2489
Container-title:Information
language:en
Short-container-title:Information

Author:

Song Guizhe^ORCID,Huang Degen,Xiao Zhifeng

Abstract

Multilingual characteristics, lack of annotated data, and imbalanced sample distribution are the three main challenges for toxic comment analysis in a multilingual setting. This paper proposes a multilingual toxic text classifier which adopts a novel fusion strategy that combines different loss functions and multiple pre-training models. Specifically, the proposed learning pipeline starts with a series of pre-processing steps, including translation, word segmentation, purification, text digitization, and vectorization, to convert word tokens to a vectorized form suitable for the downstream tasks. Two models, multilingual bidirectional encoder representation from transformers (MBERT) and XLM-RoBERTa (XLM-R), are employed for pre-training through Masking Language Modeling (MLM) and Translation Language Modeling (TLM), which incorporate semantic and contextual information into the models. We train six base models and fuse them to obtain three fusion models using the F1 scores as the weights. The models are evaluated on the Jigsaw Multilingual Toxic Comment dataset. Experimental results show that the best fusion model outperforms the two state-of-the-art models, MBERT and XLM-R, in F1 score by 5.05% and 0.76%, respectively, verifying the effectiveness and robustness of the proposed fusion strategy.

Publisher

MDPI AG

Subject

Information Systems

Link

https://www.mdpi.com/2078-2489/12/5/205/pdf

Reference57 articles.

1. Challenges for toxic comment classification: An in-depth error analysis;van Aken;arXiv,2018

2. QutNocturnal@ HASOC’19: CNN for hate speech and offensive content identification in Hindi language;Bashar;arXiv,2020

3. BEEP! Korean Corpus of Online News Comments for Toxic Speech Detection;Moon;arXiv,2020

4. Reducing Unintended Identity Bias in Russian Hate Speech Detection;Zueva;arXiv,2020

5. Comparing pre-trained language models for Spanish hate speech detection

Cited by 10 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. A Systematic Review of Toxicity in Large Language Models: Definitions, Datasets, Detectors, Detoxification Methods and Challenges;2024-07-15

2. Technical Challenges to Automated Detection of Toxic Language;Algorithms for Intelligent Systems;2024

3. Evaluating the Effectiveness of Capsule Neural Network in Toxic Comment Classification Using Pre-Trained BERT Embeddings;TENCON 2023 - 2023 IEEE Region 10 Conference (TENCON);2023-10-31

4. XRBi-GAC: A hybrid deep learning framework for multilingual toxicity detection;Journal of Intelligent & Fuzzy Systems;2023-07-02

5. Leveraging Readability and Sentiment in Spam Review Filtering Using Transformer Models;Computer Systems Science and Engineering;2023