Multi‐Scale Feature Attention‐DEtection TRansformer: Multi‐Scale Feature Attention for security check object detection

Author:

Sima Haifeng12,Chen Bailiang1ORCID,Tang Chaosheng1ORCID,Zhang Yudong13ORCID,Sun Junding1

Affiliation:

1. School of Computer Science and Technology Henan Polytechnic University Jiaozuo China

2. Institute of Quantitative Remote Sensing and Smart Agriculture Henan Polytechnic University Jiaozuo China

3. School of Computing and Mathematical Sciences University of Leicester Leicester UK

Abstract

AbstractX‐ray security checks aim to detect contraband in luggage; however, the detection accuracy is hindered by the overlapping and significant size differences of objects in X‐ray images. To address these challenges, the authors introduce a novel network model named Multi‐Scale Feature Attention (MSFA)‐DEtection TRansformer (DETR). Firstly, the pyramid feature extraction structure is embedded into the self‐attention module, referred to as the MSFA. Leveraging the MSFA module, MSFA‐DETR extracts multi‐scale feature information and amalgamates them into high‐level semantic features. Subsequently, these features are synergised through attention mechanisms to capture correlations between global information and multi‐scale features. MSFA significantly bolsters the model's robustness across different sizes, thereby enhancing detection accuracy. Simultaneously, A new initialisation method for object queries is proposed. The authors’ foreground sequence extraction (FSE) module extracts key feature sequences from feature maps, serving as prior knowledge for object queries. FSE expedites the convergence of the DETR model and elevates detection accuracy. Extensive experimentation validates that this proposed model surpasses state‐of‐the‐art methods on the CLCXray and PIDray datasets.

Funder

Science and Technology Department of Henan Province

National Natural Science Foundation of China

Publisher

Institution of Engineering and Technology (IET)

Subject

Computer Vision and Pattern Recognition,Software

Reference54 articles.

1. Object Detection in 20 Years: A Survey

2. End-to-End Object Detection with Transformers

3. Wang Y. et al.:Anchor DETR: query design for transformer‐based object detection. In: Proceedings of the AAAI Conference on Artificial Intelligence  vol.36 pp. 2567–2575 (2022)

4. Liu S. et al.:DAB‐DETR: dynamic anchor boxes are better queries for DETR. In: International Conference on Learning Representations (2022).https://openreview.net/forum?id=oMI9PjOb9Jl

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3