Funder
Key Technology Research and Development Program of Shandong Province
Program for Changjiang Scholars and Innovative Research Team in University
Natural Science Basic Research Program of Shaanxi Province
Foundation for Innovative Research Groups of the National Natural Science Foundation of China
Ministry of Education of the People's Republic of China
Higher Education Discipline Innovation Project
National Natural Science Foundation of China
Reference62 articles.
1. A. Dosovitskiy, L. Beyer, A. Kolesnikov, D. Weissenborn, X. Zhai, T. Unterthiner, M. Dehghani, M. Minderer, G. Heigold, S. Gelly, et al., An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale, in: International Conference on Learning Representations, 2020.
2. A survey on vision transformer;Han;IEEE Trans. Pattern Anal. Mach. Intell.,2022
3. Vision transformers for dense prediction: A survey;Zuo;Knowl.-Based Syst.,2022
4. Learning reliable modal weight with transformer for robust RGBT tracking;Feng;Knowl.-Based Syst.,2022
5. L. Yuan, Y. Chen, T. Wang, W. Yu, Y. Shi, Z.-H. Jiang, F.E. Tay, J. Feng, S. Yan, Tokens-to-token vit: Training vision transformers from scratch on imagenet, in: Proceedings of the IEEE/CVF International Conference on Computer Vision, 2021, pp. 558–567.