Affiliation:
1. School of Computer Science and Engineering, Xi’an Technological University, Xi’an 710021, China
2. State and Provincial Joint Engineering Laboratory of Advanced Network, Monitoring and Control, Xi’an 710021, China
Abstract
Due to the limited semantic information extraction with small objects and difficulty in distinguishing similar targets, it brings great challenges to target detection in remote sensing scenarios, which results in poor detection performance. This paper proposes an improved YOLOv5 remote sensing image target detection algorithm, SEB-YOLO (SPD-Conv + ECSPP + Bi-FPN + YOLOv5). Firstly, the space-to-depth (SPD) layer followed by a non-strided convolution (Conv) layer module (SPD-Conv) was used to reconstruct the backbone network, which retained the global features and reduced the feature loss. Meanwhile, the pooling module with the attention mechanism of the final layer of the backbone network was designed to help the network better identify and locate the target. Furthermore, a bidirectional feature pyramid network (Bi-FPN) with bilinear interpolation upsampling was added to improve bidirectional cross-scale connection and weighted feature fusion. Finally, the decoupled head is introduced to enhance the model convergence and solve the contradiction between the classification task and the regression task. Experimental results on NWPU VHR-10 and RSOD datasets show that the mAP of the proposed algorithm reaches 93.5% and 93.9%respectively, which is 4.0% and 5.3% higher than that of the original YOLOv5l algorithm. The proposed algorithm achieves better detection results for complex remote sensing images.
Funder
Natural Science Basic Research Project of Shaanxi Provincial Department of Science and Technology
Reference50 articles.
1. Face recognition using Histograms of Oriented Gradients;Bueno;Pattern Recognit. Lett.,2011
2. Graph-based visual saliency;Harel;Adv. Neural Inf. Process. Syst.,2006
3. Remote sensing image matching based on adaptive binning SIFT descriptor;Sedaghat;IEEE Trans. Geosci. Remote Sens.,2015
4. Yan, B., Wang, D., Lu, H., and Yang, X. (2020, January 14–19). Cooling-Shrinking Attack: Blinding the Tracker with Imperceptible Noises. Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Seattle, WA, USA.
5. Ji, L., and Yu-Xiao, N. (2023, January 12–15). Method of Insulator Detection Based on Improved Faster R-CNN. Proceedings of the 2023 6th International Conference on Electronics Technology (ICET), Chengdu, China.
Cited by
5 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献