Exploring the complementarity between convolution and transformer matching for visual tracking
-
Published:2024-09
Issue:
Volume:300
Page:112184
-
ISSN:0950-7051
-
Container-title:Knowledge-Based Systems
-
language:en
-
Short-container-title:Knowledge-Based Systems
Author:
Wang Zheng’aoORCID, Li Ming, Pei Wenjie, Lu GuangmingORCID, Chen FanglinORCID
Reference73 articles.
1. L. Bertinetto, J. Valmadre, J.F. Henriques, A. Vedaldi, P.H.S. Torr, Fully-convolutional Siamese networks for object tracking, in: European Conference on Computer Vision, 2016, pp. 850–865. 2. B. Li, J. Yan, W. Wu, Z. Zhu, X. Hu, High performance visual tracking with Siamese region proposal network, in: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2018, pp. 8971–8980. 3. An image is worth 16x16 words: Transformers for image recognition at scale;Dosovitskiy,2020 4. B. Li, W. Wu, Q. Wang, F. Zhang, J. Xing, J. Yan, Siamrpn++: Evolution of Siamese visual tracking with very deep networks, in: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2019, pp. 4282–4291. 5. D. Guo, J. Wang, Y. Cui, Z. Wang, S. Chen, SiamCAR: Siamese fully convolutional classification and regression for visual tracking, in: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020, pp. 6269–6277.
|
|