1. The Cityscapes Dataset for Semantic Urban Scene Understanding
2. Not all images are worth 16×16 words: Dynamic transformers for efficient image recognition;wang;Advances in Neural Information Processing Systems 34 Annual Conference on Neural Information Processing Systems 2021 NeurIPS 2021,2021
3. On the Efficacy of Knowledge Distillation
4. Going deeper with Image Transformers
5. An image is worth 16×16 words: Transformers for image recognition at scale;dosovitskiy;International Conference on Learning Representations,0