Res-SwinTransformer with Local Contrast Attention for Infrared Small Target Detection

Author:

Zhao Tianhua1,Cao Jie12,Hao Qun123,Bao Chun1ORCID,Shi Moudan1

Affiliation:

1. School of Optics and Photonics, Beijing Institute of Technology, Beijing 100081, China

2. Yangtze Delta Region Academy, Beijing Institute of Technology, Jiaxing 314003, China

3. School of Opto-Electronic Engineering, Changchun University of Science and Technology, Changchun 130022, China

Abstract

Infrared small target detection for aerial remote sensing is crucial in both civil and military fields. For infrared targets with small sizes, low signal-to-noise ratio, and little detailed texture information, we propose a Res-SwinTransformer with a Local Contrast Attention Network (RSLCANet). Specifically, we first design a SwinTransformer-based backbone to improve the interaction capability of global information. On this basis, we introduce a residual structure to fully retain the shallow detail information of small infrared targets. Furthermore, we design a plug-and-play attention module named LCA Block (local contrast attention block) to enhance the target and suppress the background, which is based on local contrast calculation. In addition, we develop an air-to-ground multi-scene infrared vehicle dataset based on an unmanned aerial vehicle (UAV) platform, which can provide a database for infrared vehicle target detection algorithm testing and infrared target characterization studies. Experiments demonstrate that our method can achieve a low-miss detection rate, high detection accuracy, and high detection speed. In particular, on the DroneVehicle dataset, our designed RSLCANet increases by 4.3% in terms of mAP@0.5 compared to the base network You Only Look Once (YOLOX). In addition, our network has fewer parameters than the two-stage network and the Transformer-based network model, which helps the practical deployment and can be applied in fields such as car navigation, crop monitoring, and infrared warning.

Funder

Beijing Nature Science Foundation of China

Publisher

MDPI AG

Subject

General Earth and Planetary Sciences

Reference48 articles.

1. A locally optimized model for hyperspectral and multispectral images fusion;Ren;IEEE Trans. Geosci. Remote Sens.,2021

2. Generalized linear spectral mixing model for spatial–temporal–spectral fusion;Zhou;IEEE Trans. Geosci. Remote Sens.,2022

3. MLR-DBPFN: A multi-scale low rank deep back projection fusion network for anti-noise hyperspectral and multispectral image fusion;Sun;IEEE Trans. Geosci. Remote Sens.,2022

4. Marine floating raft aquaculture extraction of hyperspectral remote sensing images based decision tree algorithm;Hou;Int. J. Appl. Earth Obs. Geoinf.,2022

5. A simple and effective spectral-spatial method for mapping large-scale coastal wetlands using China ZY1-02D satellite hyperspectral images;Sun;Int. J. Appl. Earth Obs. Geoinf.,2021

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3