Affiliation:
1. Unmanned System Research Institute, Northwestern Polytechnical University, Xi’an 710129, China
2. Wuhan National Laboratory for Optoelectronics, Huazhong Institute of Electro-Optics, Wuhan 430073, China
Abstract
Small-scale low-altitude unmanned aerial vehicles (UAVs) equipped with perception capability for military targets will become increasingly essential for strategic reconnaissance and stationary patrols in the future. To respond to challenges such as complex terrain and weather variations, as well as the deception and camouflage of military targets, this paper proposes a hybrid detection model that combines Convolutional Neural Network (CNN) and Transformer architecture in a decoupled manner. The proposed detector consists of the C-branch and the T-branch. In the C-branch, Multi-gradient Path Network (MgpNet) is introduced, inspired by the multi-gradient flow strategy, excelling in capturing the local feature information of an image. In the T-branch, RPFormer, a Region–Pixel two-stage attention mechanism, is proposed to aggregate the global feature information of the whole image. A feature fusion strategy is proposed to merge the feature layers of the two branches, further improving the detection accuracy. Furthermore, to better simulate real UAVs’ reconnaissance environments, we construct a dataset of military targets in complex environments captured from an oblique perspective to evaluate the proposed detector. In ablation experiments, different fusion methods are validated, and the results demonstrate the effectiveness of the proposed fusion strategy. In comparative experiments, the proposed detector outperforms most advanced general detectors.
Funder
National Natural Science Foundation of China
Reference60 articles.
1. Battlefield image situational awareness application based on deep learning;Peng;IEEE Intell. Syst.,2019
2. Yang, K., Pan, A., Yang, Y., Zhang, S., Ong, S.H., and Tang, H. (2017). Remote sensing image registration using multiple image features. Remote Sens., 9.
3. Intelligent UAV swarm cooperation for multiple targets tracking;Zhou;IEEE Internet Things J.,2021
4. A Novel scene matching navigation system for UAVs based on vision/inertial fusion;Mei;IEEE Sens. J.,2023
5. Computer vision for behaviour-based safety in construction: A review and future directions;Fang;Adv. Eng. Inform.,2020