Affiliation:
1. School of Computer Science, Hunan University of Technology, Zhuzhou 412007, China
Abstract
With the significant advancements in deep learning technology, the domain of remote sensing image processing has witnessed a surge in attention, particularly in the field of object detection. The detection of targets in remotely sensed images is a challenging task, primarily due to the abundance of small-sized targets and their multi-scale distribution. These challenges often result in inaccurate object detection, leading to both missed detections and false positives. To overcome these issues, this paper presents a novel algorithm called YOLO-GCRS. This algorithm builds upon the original YOLOv5s algorithm by enhancing the feature capture capability of the backbone network. This enhancement is achieved by integrating a new module, the Global Context Block (GC-C3), with the C3 backbone network. Additionally, the algorithm incorporates a convoluted block known as CBM (Convolution + BatchNormalization + Mish) to enhance the network model’s capability of extracting depth features. Moreover, a detection head, ECAHead, is proposed, which integrates an efficient attention channel (ECA) for extracting high-dimensional features from images. It achieves higher precision, recall, and mAP@0.5 values (98.3%, 94.7%, and 97.7%, respectively) on the publicly available RSOD dataset compared to the original YOLOv5s algorithm (improving by 5.3%, 0.8%, and 2.7%, respectively). Furthermore, when compared to mainstream detection algorithms like YOLOv7-tiny and YOLOv8s, the proposed algorithm exhibits improvements of 2.0% and 7.5%, respectively, in mAP@0.5. These results provide validation for the effectiveness of our YOLO-GCRS algorithm in addressing the challenges of missed and false detections in remote sensing object detection.
Funder
National Key Research and Development Program (NKRDP) projects
Hunan Provincial Self-Science Foundation
Natural Science Foundation of Hunan Province
Open Platform Innovation Foundation of the Education Department of Hunan
Subject
Electrical and Electronic Engineering,Computer Networks and Communications,Hardware and Architecture,Signal Processing,Control and Systems Engineering
Reference32 articles.
1. Surface-to-air missile sites detection agent with remote sensing images;Liu;Sci. China Inf. Sci.,2021
2. Zhang, Y., Ning, G., Chen, S., and Yang, Y. (2021). Impact of rapid urban sprawl on the local meteorological observational environment based on remote sensing images and GIS technology. Remote Sens., 13.
3. Aircraft target detection in remote sensing images based on improved YOLOv5;Luo;IEEE Access,2022
4. Dalal, N., and Triggs, B. (2005, January 20–25). Histograms of oriented gradients for human detection. Proceedings of the 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR’05), San Diego, CA, USA.
5. Felzenszwalb, P.F., Girshick, R.B., and Mcallester, D.A. (2010, January 13–18). Cascade object detection with deformable part models. Proceedings of the 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, San Francisco, CA, USA.
Cited by
1 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献