Affiliation:
1. The State Key Laboratory of Information Engineering in Surveying Mapping and Remote Sensing, Wuhan University, Wuhan 430079, China
2. School of Remote Sensing and Information Engineering, Wuhan University, Wuhan 430079, China
Abstract
Object detection plays a crucial role in unmanned aerial vehicle (UAV) missions, where captured objects are often small and require high-resolution processing. However, this requirement is always in conflict with limited computing resources, vast fields of view, and low latency requirements. To tackle these issues, we propose GA-Net, a novel approach tailored for UAV images. The key innovation includes the Grid Activation Module (GAM), which efficiently calculates grid activations, the probability of foreground presence at grid scale. With grid activations, the GAM helps filter out patches without objects, minimize redundant computations, and improve inference speeds. Additionally, the Grid-based Dynamic Sample Selection (GDSS) focuses the model on discriminating positive samples and hard negatives, addressing background bias during training. Further enhancements involve GhostFPN, which refines Feature Pyramid Network (FPN) using Ghost module and depth-wise separable convolution. This not only expands the receptive field for improved accuracy, but also reduces computational complexity. We conducted comprehensive evaluations on DGTA-Cattle-v2, a synthetic dataset with added background images, and three public datasets (VisDrone, SeaDronesSee, DOTA) from diverse domains. The results prove the effectiveness and practical applicability of GA-Net. Despite the common accuracy and speed trade-off challenge, our GA-Net successfully achieves a mutually beneficial scenario through the strategic use of grid activations.
Funder
National Natural Science Foundation of China
Reference52 articles.
1. Survey on Unmanned Aerial Vehicle Networks for Civil Applications: A Communications Viewpoint;Hayat;IEEE Commun. Surv. Tutor.,2016
2. UAV in the Advent of the Twenties: Where We Stand and What Is Next;Nex;ISPRS J. Photogramm. Remote Sens.,2022
3. Byun, S., Shin, I.-K., Moon, J., Kang, J., and Choi, S.-I. (2021). Road Traffic Monitoring from UAV Images Using Deep Learning Networks. Remote Sens., 13.
4. TTPLA: An Aerial-Image Dataset for Detection and Segmentation of Transmission Towers and Power Lines;Ishikawa;Computer Vision–ACCV 2020,2021
5. A CNN Approach to Simultaneously Count Plants and Detect Plantation-Rows from UAV Imagery;Osco;ISPRS J. Photogramm. Remote Sens.,2021
Cited by
1 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献