ASYv3: Attention‐enabled pooling embedded Swin transformer‐based YOLOv3 for obscenity detection

Author:

Samal Sonali1ORCID,Zhang Yu‐Dong2ORCID,Gadekallu Thippa Reddy34567,Balabantaray Bunil Kumar1

Affiliation:

1. Department of Computer Science and Engineering National Institute of Technology Meghalaya Shillong India

2. School of Computer Science University of Leicester Leicester UK

3. Zhongda Group, Haiyan County Jiaxing China

4. Department of Electrical and Computer Engineering, Lebanese American University Byblos Lebanon

5. School of Information Technology and Engineering, Vellore Institute of Technology Vellore India

6. College of Information Science and Engineering, Jiaxing University Jiaxing China

7. Division of Research and development, Lovely Professional University Phagwara India

Abstract

AbstractThe rampant spread of explicit content across social media can leave a damaging mark on our society. Hence, the need to be vigilant in detecting and curtailing sexually explicit content cannot be overstated. As such, it becomes paramount to discern and manage sexually explicit material to curb its dissemination and safeguard our digital communities from its harmful effects. In this article, we propose a unique technique entitled attention‐enabled pooling (ABP) embedded Swin transformer‐based YOLOv3 (ASYv3) for the detection of obscene areas present in the images with a bounding box around the offensive regions. ASYv3 employs a unique two‐step approach for enhanced performance in obscene detection. In the first step, a scalable and efficient Swin transformer block is integrated, utilizing self‐attention and model parallelism to train massive models effectively. In the second phase, the embedding layer of the Swin transformer is replaced with ABP, mitigating disruption of feature context. ABP allows for the projection of raw‐valued features into linear form with proper attention to feature context information at specified locations, resulting in optimized feature extraction. The proposed ABP embedded Swin transformer‐based YOLOv3 (ASYv3) was trained with annotated obscene images (AOI) dataset. The proposed ASYv3 model surpassed the state‐of‐the‐art methods by achieving 97% testing accuracy, 96.62% precision, 97.40% sensitivity, 3.48% FPR rate, 97.37% NPV values, and 95.59% mAP values, respectively.

Publisher

Wiley

Subject

Artificial Intelligence,Computational Theory and Mathematics,Theoretical Computer Science,Control and Systems Engineering

Reference39 articles.

Cited by 3 articles. 订阅此论文施引文献 订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3