Few-Shot Air Object Detection Network
-
Published:2023-10-04
Issue:19
Volume:12
Page:4133
-
ISSN:2079-9292
-
Container-title:Electronics
-
language:en
-
Short-container-title:Electronics
Author:
Cai Wei1ORCID, Wang Xin1, Jiang Xinhao1, Yang Zhiyong1, Di Xingyu1, Gao Weijie1
Affiliation:
1. The Third Faculty of Xi’an Research Institute of High Technology, Xi’an 710064, China
Abstract
Focusing on the problem of low detection precision caused by the few-shot and multi-scale characteristics of air objects, we propose a few-shot air object detection network (FADNet). We first use a transformer as the backbone network of the model and then build a multi-scale attention mechanism (MAM) to deeply fuse the W- and H-dimension features extracted from the channel dimension and the local and global features extracted from the spatial dimension with the object features to improve the network’s performance when detecting air objects. Second, the neck network is innovated based on the path aggregation network (PANet), resulting in an improved path aggregation network (IPANet). Our proposed network reduces the information lost during feature transfer by introducing a jump connection, utilizes sparse connection convolution, strengthens feature extraction abilities at all scales, and improves the discriminative properties of air object features at all scales. Finally, we propose a multi-scale regional proposal network (MRPN) that can establish multiple RPNs based on the scale types of the output features, utilizing adaptive convolutions to effectively extract object features at each scale and enhancing the ability to process multi-scale information. The experimental results showed that our proposed method exhibits good performance and generalization, especially in the 1-, 2-, 3-, 5-, and 10-shot experiments, with average accuracies of 33.2%, 36.8%, 43.3%, 47.2%, and 60.4%, respectively. The FADNet solves the problems posed by the few-shot characteristics and multi-scale characteristics of air objects, as well as improving the detection capabilities of the air object detection model.
Funder
National Defense Science and Technology 173 Program Technical Field Fund Project
Subject
Electrical and Electronic Engineering,Computer Networks and Communications,Hardware and Architecture,Signal Processing,Control and Systems Engineering
Reference44 articles.
1. Demirel, B., Baran, O.B., and Cinbis, R.G. (2023, January 18–22). Meta-tuning Loss Functions and Data Augmentation for Few-shot Object Detection. Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, Vancouver, BC, Canada. 2. Yan, X., Chen, Z., Xu, A., Wang, X., Liang, X., and Lin, L. (November, January 27). Meta r-cnn: Towards general solver for instance-level low-shot learning. Proceedings of the IEEE/CVF International Conference on Computer Vision, Seoul, Republic of Korea. 3. Kang, B., Liu, Z., Wang, X., Yu, F., Feng, J., and Darrell, T. (November, January 27). Few-shot object detection via feature reweighting. Proceedings of the IEEE/CVF International Conference on Computer Vision, Seoul, Republic of Korea. 4. Gao, Z., Guo, S., Xu, C., Zhang, J., Gong, M., Del Ser, J., and Li, S. (2022). Multi-domain Adversarial Variational Bayesian Inference for Domain Generalization. IEEE Trans. Circuits Syst. Video Technol., 1. 5. Few-shot object detection via association and discrimination;Cao;Adv. Neural Inf. Process. Syst.,2021
|
|