1. Alexey Dosovitskiy and Lucas Beyer and Alexander Kolesnikov and Dirk Weissenborn and Xiaohua Zhai and Thomas Unterthiner and Mostafa Dehghani and Matthias Minderer and Georg Heigold and Sylvain Gelly and Jakob Uszkoreit and Neil Houlsby (2021) An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale. https://openreview.net/forum?id=YicbFdNTTy, International Conference on Learning Representations
2. Shannon, C.E. and Weaver, W. (1949) The {Mathematical} {Theory} of {Communication}. University of Illinois Press, Urbana
3. Goodfellow, Ian (2016) Nips 2016 tutorial: Generative adversarial networks. arXiv preprint arXiv:1701.00160
4. Strudel, Robin and Garcia, Ricardo and Laptev, Ivan and Schmid, Cordelia (2021) Segmenter: Transformer for Semantic Segmentation. 10.1109/ICCV48922.2021.00717, 7242-7252, , , 2021 IEEE/CVF International Conference on Computer Vision (ICCV)
5. Yuxin Wu and Alexander Kirillov and Francisco Massa and Wan-Yen Lo and Ross Girshick. Detectron2. 2019, https://github.com/facebookresearch/detectron2