Author:
Huo Zhanqiang,Zhang Kunwei,Luo Fen,Qiao Yingxu
Publisher
Springer Nature Singapore
Reference23 articles.
1. Chu, X., et al.: Twins: revisiting the design of spatial attention in vision transformers. In: Advances in Neural Information Processing Systems, vol. 34, pp. 9355–9366 (2021)
2. Dosovitskiy, A., Beyer, L., Kolesnikov, A., Weissenborn, D., et al.: An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale. arXiv e-prints arXiv:2010.11929 (2020)
3. Hossain, M., Hosseinzadeh, M., Chanda, O., Wang, Y.: Crowd counting using scale-aware attention networks. In: 2019 IEEE Winter Conference on Applications of Computer Vision (WACV), pp. 1280–1288 (2019)
4. Jiang, X., et al.: Attention scaling for crowd counting. In: 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), pp. 4705–4714 (2020)
5. Jiang, X., et al.: Crowd counting and density estimation by trellis encoder-decoder networks. In: 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), pp. 6126–6135 (2019)