Affiliation:
1. Changchun Institute of Optics, Fine Mechanics and Physics, Chinese Academy of Sciences, Changchun 130033, China
2. University of Chinese Academy of Sciences, Beijing 100049, China
Abstract
Benefiting from the powerful feature extraction capability of deep learning, the Siamese tracker stands out due to its advanced tracking performance. However, constrained by the complex backgrounds of aerial tracking, such as low resolution, occlusion, similar objects, small objects, scale variation, aspect ratio change, deformation and limited computational resources, efficient and accurate aerial tracking is still difficult to realize. In this work, we design a lightweight and efficient adaptive temporal contextual aggregation Siamese network for aerial tracking, which is designed with a parallel atrous module (PAM) and adaptive temporal context aggregation model (ATCAM) to mitigate the above problems. Firstly, by using a series of atrous convolutions with different dilation rates in parallel, the PAM can simultaneously extract and aggregate multi-scale features with spatial contextual information at the same feature map, which effectively improves the ability to cope with changes in target appearance caused by challenges such as aspect ratio change, occlusion, scale variation, etc. Secondly, the ATCAM adaptively introduces temporal contextual information to the target frame through the encoder-decoder structure, which helps the tracker resist interference and recognize the target when it is difficult to extract high-resolution features such as low-resolution, similar objects. Finally, experiments on the UAV20L, UAV123@10fps and DTB70 benchmarks demonstrate the impressive performance of the proposed network running at a high speed of over 75.5 fps on the NVIDIA 3060Ti.
Funder
Natural Science Foundation of Jilin Province
Reference50 articles.
1. A Reinforcement Learning-Based Fire Warning and Suppression System Using Unmanned Aerial Vehicles;Panahi;IEEE Trans. Instrum. Meas.,2022
2. Occlusion and Deformation Handling Visual Tracking for UAV via Attention-Based Mask Generative Network;Bai;Remote Sens.,2022
3. Automated optical inspection of FAST’s reflector surface using drones and computer vision;Li;Light Adv. Manuf.,2023
4. Tracking objects from satellite videos: A velocity feature based correlation filter;Shao;IEEE Trans. Geosci. Remote Sens.,2019
5. Su, Y., Liu, J., Xu, F., Zhang, X., and Zuo, Y. (2021). A Novel Anti-Drift Visual Object Tracking Algorithm Based on Sparse Response and Adaptive Spatial-Temporal Context-Aware. Remote Sens., 13.
Cited by
1 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献