Author:
Wang Bingbing,Zhang Fengxiang,Li Kaipeng,Shi Kuijie,Wang Lei,Liu Gang
Abstract
Small object detection has a broad application prospect in image processing of unmanned aerial vehicles, autopilot and remote sensing. However, some difficulties exactly exist in small object detection, such as aggregation, occlusion and insufficient feature extraction, resulting in a great challenge for small object detection. In this paper, we propose an improved algorithm for small object detection to address these issues. By using the spatial pyramid to extract multi-scale spatial features and by applying the multi-scale channel attention to capture the global and local semantic features, the spatial pooling pyramid and multi-scale channel attention module (SPP-MSCAM) is constructed. More importantly, the fusion of the shallower layer with higher resolution and a deeper layer with more semantic information is introduced to the neck structure for improving the sensitivity of small object features. A large number of experiments on the VisDrone2019 dataset and the NWPU VHR-10 dataset show that the proposed method significantly improves the Precision, mAP and mAP50 compared to the YOLOv5 method. Meanwhile, it still preserves a considerable real-time performance. Undoubtedly, the improved network proposed in this paper can effectively alleviate the difficulties of aggregation, occlusion and insufficient feature extraction in small object detection, which would be helpful for its potential applications in the future.
Subject
Artificial Intelligence,Computer Vision and Pattern Recognition,Theoretical Computer Science
Reference34 articles.
1. Real-Time ISR-YOLOv4 Based Small Object Detection for Safe Shop Floor in Smart Factories;Ku;Electronics,2022
2. Attention-guided CNN for image denoising;Tian;Neural Networks,2020
3. C.Y. Chen, M.Y. Liu, O. Tuzel and J.X. Xiao, R-CNN for small object detection, in: Asian Conference on Computer Vision, Springer, Cham, 2016, pp. 214–230.
4. A survey on object detection in optical remote sensing images;Cheng;ISPRS Journal of Photogrammetry and Remote Sensing,2016
5. Learning rotation-invariant convolutional neural networks for object detection in VHR optical remote sensing images;Cheng;IEEE Transactions on Geoscience and Remote Sensing,2016
Cited by
1 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献
1. Methods of Analyzing Random Point Structures in Solving Some Applied Engineering Problems;2024 International Conference on Industrial Engineering, Applications and Manufacturing (ICIEAM);2024-05-20