1. P. Hirugade, N. Suryavanshi, R. Bhagwat, S. Rajput, and R. Phadke, “A survey on optical character recognition for handwritten devanagari script using deep learning,” Available at SSRN 4031738 (2022).
2. A. Dosovitskiy, L. Beyer, A. Kolesnikov, D. Weissenborn, X. Zhai, T. Unterthiner, M. Dehghani, M. Minderer, G. Heigold, S. Gelly, et al., “An image is worth 16x16 words: Transformers for image recognition at scale,” arXiv preprint arXiv:2010.11929 (2020).
3. A. Vaswani, N. Shazeer, N. Parmar, J. Uszkoreit, L. Jones, A. N. Gomez, Ł. Kaiser, and I. Polosukhin, “Attention is all you need,” Advances in neural information processing systems 30 (2017).
4. A survey of transfer learning
5. R. Wightman, “Pytorch image models,” https://github.com/rwightman/pytorch-image-models (2019).