ViTAEv2: Vision Transformer Advanced by Exploring Inductive Bias for Image Recognition and Beyond-Reference-Cited by-同舟云学术

ViTAEv2: Vision Transformer Advanced by Exploring Inductive Bias for Image Recognition and Beyond

Published:2023-01-12 Issue:5 Volume:131 Page:1141-1162
ISSN:0920-5691
Container-title:International Journal of Computer Vision
language:en
Short-container-title:Int J Comput Vis

Author:

Zhang Qiming,Xu Yufei,Zhang Jing,Tao Dacheng

Publisher

Springer Science and Business Media LLC

Subject

Artificial Intelligence,Computer Vision and Pattern Recognition,Software

Link

https://link.springer.com/content/pdf/10.1007/s11263-022-01739-w.pdf

Reference107 articles.

1. Adelson, E. H., Anderson, C. H., Bergen, J. R., Burt, P. J., & Ogden, J. M. (1984). Pyramid methods in image processing. RCA Engineer, 29(6), 33–41.

2. Ali, A., Touvron, H., Caron, M., Bojanowski, P., Douze, M., Joulin, A., Laptev, I., Neverova, N., Synnaeve, G., Verbeek, J., et al. (2021). Xcit: Cross-covariance image transformers. Advances in Neural Information Processing Systems, 34, 20014–20027.

3. Ba, J.L., Kiros, J.R., Hinton, G.E. (2016). Layer normalization. arXiv preprint arXiv:1607.06450

4. Bao, H., Dong, L., Piao, S., Wei, F. (2021). Beit: Bert pre-training of image transformers. In: International conference on learning representations

5. Bay, H., Tuytelaars, T., Van Gool, L. (2006). Surf: Speeded up robust features. In: European conference on computer vision, Springer, pp. 404–417

Cited by 97 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Towards robust neural networks: Exploring counterfactual causality-based repair;Expert Systems with Applications;2024-12

2. Object detection with a dynamic interactive network based on relational graph routing;Applied Soft Computing;2024-11

3. Vision transformer promotes cancer diagnosis: A comprehensive review;Expert Systems with Applications;2024-10

4. Fine-grained gaze estimation based on the combination of regression and classification losses;Applied Intelligence;2024-09-03

5. HIRI-ViT: Scaling Vision Transformer With High Resolution Inputs;IEEE Transactions on Pattern Analysis and Machine Intelligence;2024-09