Affiliation:
1. School of Optics and Photonics, Beijing Institute of Technology, Beijing 100081, China
2. Yangtze Delta Region Academy, Beijing Institute of Technology, Jiaxing 314003, China
3. School of Opto-Electronic Engineering, Changchun University of Science and Technology, Changchun 130022, China
Abstract
Infrared small target detection for aerial remote sensing is crucial in both civil and military fields. For infrared targets with small sizes, low signal-to-noise ratio, and little detailed texture information, we propose a Res-SwinTransformer with a Local Contrast Attention Network (RSLCANet). Specifically, we first design a SwinTransformer-based backbone to improve the interaction capability of global information. On this basis, we introduce a residual structure to fully retain the shallow detail information of small infrared targets. Furthermore, we design a plug-and-play attention module named LCA Block (local contrast attention block) to enhance the target and suppress the background, which is based on local contrast calculation. In addition, we develop an air-to-ground multi-scene infrared vehicle dataset based on an unmanned aerial vehicle (UAV) platform, which can provide a database for infrared vehicle target detection algorithm testing and infrared target characterization studies. Experiments demonstrate that our method can achieve a low-miss detection rate, high detection accuracy, and high detection speed. In particular, on the DroneVehicle dataset, our designed RSLCANet increases by 4.3% in terms of mAP@0.5 compared to the base network You Only Look Once (YOLOX). In addition, our network has fewer parameters than the two-stage network and the Transformer-based network model, which helps the practical deployment and can be applied in fields such as car navigation, crop monitoring, and infrared warning.
Funder
Beijing Nature Science Foundation of China
Subject
General Earth and Planetary Sciences
Reference48 articles.
1. A locally optimized model for hyperspectral and multispectral images fusion;Ren;IEEE Trans. Geosci. Remote Sens.,2021
2. Generalized linear spectral mixing model for spatial–temporal–spectral fusion;Zhou;IEEE Trans. Geosci. Remote Sens.,2022
3. MLR-DBPFN: A multi-scale low rank deep back projection fusion network for anti-noise hyperspectral and multispectral image fusion;Sun;IEEE Trans. Geosci. Remote Sens.,2022
4. Marine floating raft aquaculture extraction of hyperspectral remote sensing images based decision tree algorithm;Hou;Int. J. Appl. Earth Obs. Geoinf.,2022
5. A simple and effective spectral-spatial method for mapping large-scale coastal wetlands using China ZY1-02D satellite hyperspectral images;Sun;Int. J. Appl. Earth Obs. Geoinf.,2021
Cited by
4 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献