Affiliation:
1. College of Field Engineering, Army Engineering University of PLA, Nanjing 210007, China
Abstract
The objects in UAV aerial images have multiple scales, dense distribution, and occlusion, posing considerable challenges for object detection. In order to address this problem, this paper proposes a real-time multi-scale object detection method based on an improved YOLOv7 model (ATS-YOLOv7) for UAV aerial images. First, this paper introduces a feature pyramid network, AF-FPN, which is composed of an adaptive attention module (AAM) and a feature enhancement module (FEM). AF-FPN reduces the loss of deep feature information due to the reduction of feature channels in the convolution process through the AAM and FEM, strengthens the feature perception ability, and improves the detection speed and accuracy for multi-scale objects. Second, we add a prediction head based on a transformer encoder block on the basis of the three-head structure of YOLOv7, improving the ability of the model to capture global information and feature expression, thus achieving efficient detection of objects with tiny scales and dense occlusion. Moreover, as the location loss function of YOLOv7, CIoU (complete intersection over union), cannot facilitate the regression of the prediction box angle to the ground truth box—resulting in a slow convergence rate during model training—this paper proposes a loss function with angle regression, SIoU (soft intersection over union), in order to accelerate the convergence rate during model training. Finally, a series of comparative experiments are carried out on the DIOR dataset. The results indicate that ATS-YOLOv7 has the best detection accuracy (mAP of 87%) and meets the real-time requirements of image processing (detection speed of 94.2 FPS).
Funder
National Natural Science Foundation of China
Subject
Electrical and Electronic Engineering,Computer Networks and Communications,Hardware and Architecture,Signal Processing,Control and Systems Engineering
Reference70 articles.
1. Efficient Approach for Autonomous Facility Inspection Using UAV Images;Lin;Infrastruct. Syst.,2022
2. Watanabe, J.-I., Ren, S., Zhao, Y., and Yamamoto, T. (2018, January 16–18). Power line-tree conflict detection and 3D mapping using aerial images taken from UAV. Proceedings of the SPIE Defense + Security Conference 2018, Orlando, FL, USA.
3. Discrete Optimization on Truck-Drone Collaborative Transportation System for Delivering Medical Resources;Lin;Discret. Dyn. Nat. Soc.,2022
4. Pietikinen, M., and Silven, O. (2022). Challenges of Artificial Intelligence–From Machine Learning and Computer Vision to Emotional Intelligence. arXiv.
5. Application of a low-cost camera on a UAV to estimate maize nitrogen-related variables;Corti;Precis. Agric.,2019
Cited by
3 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献