1. 3d object representations for fine-grained categorization, in;Krause,2013
2. C. Wah, S. Branson, P. Welinder, et al., The caltech-ucsd birds-200-2011 dataset, (2011) 10.
3. S. Maji, E. Rahtu, J. Kannala, et al., Fine-grained visual classification of aircraft, 2013, arXiv preprint arXiv:1306.5151.
4. A. Dosovitskiy, L. Beyer, A. Kolesnikov, et al., An image is worth 16x16 words: Transformers for image recognition at scale, 2020, arXiv preprint arXiv:2010.11929.
5. Swin transformer: hierarchical vision transformer using shifted windows;Liu,2021