Affiliation:
1. College of Big Data Statistics, Guizhou University of Finance and Economics, Guiyang 550025, China
2. Key Laboratory of Advanced Manufacturing Technology, Ministry of Education, Guizhou University, Guiyang 550025, China
Abstract
UAV multitarget detection plays a pivotal role in civil and military fields. Although deep learning methods provide a more effective solution to this task, changes in target size, shape change, occlusion, and lighting conditions from the perspective of drones still bring great challenges to research in this field. Based on the above problems, this paper proposes an aerial image detection model with excellent performance and strong robustness. First, in view of the common problem that small targets in aerial images are prone to misdetection and missed detection, the idea of Bi-PAN-FPN is introduced to improve the neck part in YOLOv8-s. By fully considering and reusing multiscale features, a more advanced and complete feature fusion process is achieved while maintaining the parameter cost as much as possible. Second, the GhostblockV2 structure is used in the backbone of the benchmark model to replace part of the C2f module, which suppresses information loss during long-distance feature transmission while significantly reducing the number of model parameters; finally, WiseIoU loss is used as bounding box regression loss, combined with a dynamic nonmonotonic focusing mechanism, and the quality of anchor boxes is evaluated by using “outlier” so that the detector takes into account different quality anchor boxes to improve the overall performance of the detection task. The algorithm’s performance is compared and evaluated on the VisDrone2019 dataset, which is widely used worldwide, and a detailed ablation experiment, contrast experiment, interpretability experiment, and self-built dataset experiment are designed to verify the effectiveness and feasibility of the proposed model. The results show that the proposed aerial image detection model has achieved obvious results and advantages in various experiments, which provides a new idea for the deployment of deep learning in the field of UAV multitarget detection.
Funder
Youth Science and Technology Talent Growth Project of Guizhou Provincial Department of Education
Research Fund of Guizhou University of Finance and Economics
National Natural Science Foundation of China
Guizhou Provincial Science and Technology Plan Project
the Open Fund Project supported by the Key Laboratory of Advanced Manufacturing Technology Ministry of Education
the Guizhou Province Graduate Research Fund
Subject
Artificial Intelligence,Computer Science Applications,Aerospace Engineering,Information Systems,Control and Systems Engineering
Reference41 articles.
1. Adaimi, G., Kreiss, S., and Alahi, A. (2020). Perceiving Traffic from Aerial Images. arXiv.
2. Vehicle Detection from UAV Imagery with Deep Learning: A Review;Bouguettaya;IEEE Trans. Neural Netw. Learn. Syst.,2021
3. Byun, S., Shin, I.-K., Moon, J., Kang, J., and Choi, S.-I. (2021). Road Traffic Monitoring from UAV Images Using Deep Learning Networks. Remote Sens., 13.
4. Chang, Y.-C., Chen, H.-T., Chuang, J.-H., and Liao, I.-C. (2018, January 7–10). Pedestrian Detection in Aerial Images Using Vanishing Point Transformation and Deep Learning. Proceedings of the 2018 25th IEEE International Conference on Image Processing (ICIP), Athens, Greece.
5. Deep Learning Approach in Aerial Imagery for Supporting Land Search and Rescue Missions;Gotovac;Int. J. Comput. Vis.,2019
Cited by
107 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献