Affiliation:
1. School of Software, Xinjiang University, Urumqi 830091, China
2. College of Information Science and Engineering, Xinjiang University, Urumqi 830046, China
Abstract
The most significant technical challenges of current aerial image object-detection tasks are the extremely low accuracy for detecting small objects that are densely distributed within a scene and the lack of semantic information. Moreover, existing detectors with large parameter scales are unsuitable for aerial image object-detection scenarios oriented toward low-end GPUs. To address this technical challenge, we propose efficient-lightweight You Only Look Once (EL-YOLO), an innovative model that overcomes the limitations of existing detectors and low-end GPU orientation. EL-YOLO surpasses the baseline models in three key areas. Firstly, we design and scrutinize three model architectures to intensify the model’s focus on small objects and identify the most effective network structure. Secondly, we design efficient spatial pyramid pooling (ESPP) to augment the representation of small-object features in aerial images. Lastly, we introduce the alpha-complete intersection over union (α-CIoU) loss function to tackle the imbalance between positive and negative samples in aerial images. Our proposed EL-YOLO method demonstrates a strong generalization and robustness for the small-object detection problem in aerial images. The experimental results show that, with the model parameters maintained below 10 M while the input image size was unified at 640 × 640 pixels, the APS of the EL-YOLOv5 reached 10.8% and 10.7% and enhanced the APs by 1.9% and 2.2% compared to YOLOv5 on two challenging aerial image datasets, DIOR and VisDrone, respectively.
Funder
National Natural Science Foundation of China
Key R&D projects in the Xinjiang Uygur Autonomous Region
Natural Science Foundation of the Xinjiang Uygur Autonomous Region of China
Xinjiang University doctoral postgraduate innovation project
Subject
Electrical and Electronic Engineering,Biochemistry,Instrumentation,Atomic and Molecular Physics, and Optics,Analytical Chemistry
Reference44 articles.
1. Object detection in optical remote sensing images: A survey and a new benchmark;Li;ISPRS J. Photogramm. Remote Sens.,2020
2. Fully Convolutional Networks for Semantic Segmentation;Shelhamer;IEEE Trans. Pattern Anal. Mach. Intell.,2017
3. Ma, W., Guo, Q., Wu, Y., Zhao, W., Zhang, X., and Jiao, L. (2019). A Novel Multi-Model Decision Fusion Network for Object Detection in Remote Sensing Images. Remote Sens., 11.
4. SRUN: Spectral Regularized Unsupervised Networks for Hyperspectral Target Detection;Xie;IEEE Trans. Geosci. Remote Sens.,2020
5. Diverse sample generation with multi-branch conditional generative adversarial network for remote sensing objects detection;Zhu;Neurocomputing,2020
Cited by
18 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献