<scp>ASYv3</scp>: Attention‐enabled pooling embedded Swin transformer‐based <scp>YOLOv3</scp> for obscenity detection-Reference-Cited by-同舟云学术

ASYv3: Attention‐enabled pooling embedded Swin transformer‐based YOLOv3 for obscenity detection

Published:2023-05-16 Issue:8 Volume:40 Page:
ISSN:0266-4720
Container-title:Expert Systems
language:en
Short-container-title:Expert Systems

Author:

Samal Sonali¹^ORCID,Zhang Yu‐Dong²^ORCID,Gadekallu Thippa Reddy³⁴⁵⁶⁷,Balabantaray Bunil Kumar¹

Affiliation:

1. Department of Computer Science and Engineering National Institute of Technology Meghalaya Shillong India

2. School of Computer Science University of Leicester Leicester UK

3. Zhongda Group, Haiyan County Jiaxing China

4. Department of Electrical and Computer Engineering, Lebanese American University Byblos Lebanon

5. School of Information Technology and Engineering, Vellore Institute of Technology Vellore India

6. College of Information Science and Engineering, Jiaxing University Jiaxing China

7. Division of Research and development, Lovely Professional University Phagwara India

Abstract

AbstractThe rampant spread of explicit content across social media can leave a damaging mark on our society. Hence, the need to be vigilant in detecting and curtailing sexually explicit content cannot be overstated. As such, it becomes paramount to discern and manage sexually explicit material to curb its dissemination and safeguard our digital communities from its harmful effects. In this article, we propose a unique technique entitled attention‐enabled pooling (ABP) embedded Swin transformer‐based YOLOv3 (ASYv3) for the detection of obscene areas present in the images with a bounding box around the offensive regions. ASYv3 employs a unique two‐step approach for enhanced performance in obscene detection. In the first step, a scalable and efficient Swin transformer block is integrated, utilizing self‐attention and model parallelism to train massive models effectively. In the second phase, the embedding layer of the Swin transformer is replaced with ABP, mitigating disruption of feature context. ABP allows for the projection of raw‐valued features into linear form with proper attention to feature context information at specified locations, resulting in optimized feature extraction. The proposed ABP embedded Swin transformer‐based YOLOv3 (ASYv3) was trained with annotated obscene images (AOI) dataset. The proposed ASYv3 model surpassed the state‐of‐the‐art methods by achieving 97% testing accuracy, 96.62% precision, 97.40% sensitivity, 3.48% FPR rate, 97.37% NPV values, and 95.59% mAP values, respectively.

Publisher

Wiley

Subject

Artificial Intelligence,Computational Theory and Mathematics,Theoretical Computer Science,Control and Systems Engineering

Link

https://onlinelibrary.wiley.com/doi/pdf/10.1111/exsy.13337

Reference39 articles.

1. Transfer Detection of YOLO to Focus CNN’s Attention on Nude Regions for Adult Content Detection

2. Avila S. &Ara'ujo A. D. A.(2018).NPDI porn dataset. The Institute of Computing at UNICAMP.

3. Pooling in image representation: The visual codeword point of view

4. Explicit image detection using YCbCr space color model as skin detection;Basilio J. A. M.;Applications of Mathematics and Computer Engineering,2011

5. Chai D. &Bouzerdoum A.(2000).A Bayesian approach to skin color classification in YCbCr color space. 2000 TENCON proceedings. Intelligent Systems and Technologies for the New Millennium (Cat. No. 00CH37119) vol. 2 IEEE. p. 421–424.

Cited by 3 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Computer vision classification detection of chicken parts based on optimized Swin-Transformer;CyTA - Journal of Food;2024-05-08

2. LSiF: Log-Gabor Empowered Siamese Federated Learning for Efficient Obscene Image Classification in the Era of Industry 5.0;Communications in Computer and Information Science;2023-11-27

3. A Comprehensive Survey of Machine Learning Methods for Surveillance Videos Anomaly Detection;IEEE Access;2023