On Obstructing Obscenity Obfuscation-Reference-Cited by-同舟云学术

On Obstructing Obscenity Obfuscation

Published:2017-05-12 Issue:2 Volume:11 Page:1-24
ISSN:1559-1131
Container-title:ACM Transactions on the Web
language:en
Short-container-title:ACM Trans. Web

Author:

Rojas-Galeano Sergio¹

Affiliation:

1. Universidad Distrital FJC, Bogotá, Colombia

Abstract

Obscenity (the use of rude words or offensive expressions) has spread from informal verbal conversations to digital media, becoming increasingly common on user-generated comments found in Web forums, newspaper user boards, social networks, blogs, and media-sharing sites. The basic obscenity-blocking mechanism is based on verbatim comparisons against a blacklist of banned vocabulary; however, creative users circumvent these filters by obfuscating obscenity with symbol substitutions or bogus segmentations that still visually preserve the original semantics, such as writing shit as $h¡;t or s.h.i.t or even worse mixing them as $.h….¡.t . The number of potential obfuscated variants is combinatorial, yielding the verbatim filter impractical. Here we describe a method intended to obstruct this anomaly inspired by sequence alignment algorithms used in genomics, coupled with a tailor-made edit penalty function. The method only requires to set up the vocabulary of plain obscenities; no further training is needed. Its complexity on screening a single obscenity is linear, both in runtime and memory, on the length of the user-generated text. We validated the method on three different experiments. The first one involves a new dataset that is also introduced in this article; it consists of a set of manually annotated real-life comments in Spanish, gathered from the news user boards of an online newspaper, containing this type of obfuscation. The second one is a publicly available dataset of comments in Portuguese from a sports Web site. In these experiments, at the obscenity level, we observed recall rates greater than 90%, whereas precision rates varied between 75% and 95%, depending on their sequence length (shorter lengths yielded a higher number of false alarms). On the other hand, at the comment level, we report recall of 86%, precision of 91%, and specificity of 98%. The last experiment revealed that the method is more effective in matching this type of obfuscation compared to the classical Levenshtein edit distance. We conclude discussing the prospects of the method to help enforcing moderation rules of obscenity expressions or as a preprocessing mechanism for sequence cleaning and/or feature extraction in more sophisticated text categorization techniques.

Publisher

Association for Computing Machinery (ACM)

Subject

Computer Networks and Communications

Link

https://dl.acm.org/doi/pdf/10.1145/3032963

Reference60 articles.

1. Youssef Bassil and Paul Semaan. 2012. ASR context-sensitive error correction based on Microsoft n-gram dataset. arXiv:1203.5262. Youssef Bassil and Paul Semaan. 2012. ASR context-sensitive error correction based on Microsoft n-gram dataset. arXiv:1203.5262.

2. Approximate regular expression matching with multi-strings

3. Cyber Hate Speech on Twitter: An Application of Machine Classification and Statistical Modeling for Policy and Decision Making

4. Us and them: identifying cyber hate on Twitter across multiple protected characteristics

5. Knowledge-Based Approaches to Concept-Level Sentiment Analysis

Cited by 14 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Leveraging Large Language Models (LLMs) to Support Collaborative Human-AI Online Risk Data Annotation;SSRN Electronic Journal;2024

2. Abusive Content Detection on Social Networks Using Machine Learning;Lecture Notes in Electrical Engineering;2023

3. OCR post-correction for detecting adversarial text images;Journal of Information Security and Applications;2022-05

4. Investigating the role of swear words in abusive language detection tasks;Language Resources and Evaluation;2022-02-17

5. (Semi-)Automatische Kommentarmoderation zur Erhaltung Konstruktiver Diskurse;Aktivismus- und Propagandaforschung;2022