Affiliation:
1. College of Artificial Intelligence and Automation, Hohai University, Changzhou 213200, China
2. School of Mechanical and Electrical Engineering, Hohai University, Changzhou 213200, China
Abstract
Small object detection for unmanned aerial vehicle (UAV) image scenarios is a challenging task in the computer vision field. Some problems should be further studied, such as the dense small objects and background noise in high-altitude aerial photography images. To address these issues, an enhanced YOLOv8s-based model for detecting small objects is presented. The proposed model incorporates a parallel multi-scale feature extraction module (PMSE), which enhances the feature extraction capability for small objects by generating adaptive weights with different receptive fields through parallel dilated convolution and deformable convolution, and integrating the generated weight information into shallow feature maps. Then, a scale compensation feature pyramid network (SCFPN) is designed to integrate the spatial feature information derived from the shallow neural network layers with the semantic data extracted from the higher layers of the network, thereby enhancing the network’s capacity for representing features. Furthermore, the largest-object detection layer is removed from the original detection layers, and an ultra-small-object detection layer is applied, with the objective of improving the network’s detection performance for small objects. Finally, the WIOU loss function is employed to balance high- and low-quality samples in the dataset. The results of the experiments conducted on the two public datasets illustrate that the proposed model can enhance the object detection accuracy in UAV image scenarios.
Funder
National Natural Science Foundation of China
Jiangsu Province Key R&D Program
Reference65 articles.
1. Tang, G., Ni, J., Zhao, Y., Gu, Y., and Cao, W. (2024). A Survey of Object Detection for UAVs Based on Deep Learning. Remote Sens., 16.
2. Vehicle Detection From UAV Imagery With Deep Learning: A Review;Bouguettaya;IEEE Trans. Neural Networks Learn. Syst.,2022
3. Deep learning-based scene understanding for autonomous robots: A survey;Ni;Intell. Robot.,2023
4. BASNet: Burned Area Segmentation Network for Real-Time Detection of Damage Maps in Remote Sensing Images;Bo;IEEE Trans. Geosci. Remote Sens.,2022
5. Zhu, J., Yang, G., Feng, X., Li, X., Fang, H., Zhang, J., Bai, X., Tao, M., and He, Y. (2022). Detecting Wheat Heads from UAV Low-Altitude Remote Sensing Images Using Deep Learning Based on Transformer. Remote Sens., 14.