Global Multi-Scale Optimization and Prediction Head Attentional Siamese Network for Aerial Tracking
Author:
Chen Qiqi12ORCID, Liu Jinghong1, Wang Xuan1, Zuo Yujia1, Liu Chenglong1
Affiliation:
1. Changchun Institute of Optics, Fine Mechanics and Physics, Chinese Academy of Sciences, Changchun 130033, China 2. University of Chinese Academy of Sciences, Beijing 100049, China
Abstract
Siamese-based trackers have been widely used in object tracking. However, aerial remote tracking suffers from various challenges such as scale variation, viewpoint change, background clutter and occlusion, while most existing Siamese trackers are limited to single-scale and local features, making it difficult to achieve accurate aerial tracking. We propose the global multi-scale optimization and prediction head attentional Siamese network to solve this problem and improve aerial tracking performance. Firstly, a transformer-based multi-scale and global feature encoder (TMGFE) is proposed to obtain global multi-scale optimization of features. Then, the prediction head attentional module (PHAM) is proposed to add context information to the prediction head by adaptively adjusting the spatial position and channel contribution of the response map. Benefiting from these two components, the proposed tracker solves these challenges of aerial remote sensing tracking to some extent and improves tracking performance. Additionally, we conduct ablation experiments on aerial tracking benchmarks, including UAV123, UAV20L, UAV123@10fps and DTB70, to verify the effectiveness of the proposed network. The comparisons of our tracker with several state-of-the-art (SOTA) trackers are also conducted on four benchmarks to verify its superior performance. It runs at 40.8 fps on the GPU RTX3060ti.
Funder
National Natural Science Foundation of China
Subject
Physics and Astronomy (miscellaneous),General Mathematics,Chemistry (miscellaneous),Computer Science (miscellaneous)
Reference53 articles.
1. Bai, Y., Song, Y., Zhao, Y., Zhou, Y., Wu, X., He, Y., Zhang, Z., Yang, X., and Hao, Q. (2022). Occlusion and Deformation Handling Visual Tracking for UAV via Attention-Based Mask Generative Network. Remote Sens., 14. 2. Cao, J., Song, C., Song, S., Xiao, F., Zhang, X., Liu, Z., and Ang, M.H. (2021). Robust Object Tracking Algorithm for Autonomous Vehicles in Complex Scenes. Remote Sens., 13. 3. Sun, L., Yang, Z., Zhang, J., Fu, Z., and He, Z. (2022). Visual Object Tracking for Unmanned Aerial Vehicles Based on the Template-Driven Siamese Network. Remote Sens., 14. 4. Automated optical inspection of FAST’s reflector surface using drones and computer vision;Li;Light Adv. Manuf.,2023 5. Su, Y., Liu, J., Xu, F., Zhang, X., and Zuo, Y. (2021). A Novel Anti-Drift Visual Object Tracking Algorithm Based on Sparse Response and Adaptive Spatial-Temporal Context-Aware. Remote Sens., 13.
Cited by
2 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献
|
|