1. Fosnet: An end-to-end trainable deep neural network for scene recognition;seong;IEEE Access,2020
2. An image is worth 16x16 words: Transformers for image recognition at scale;dosovitskiy;ArXiv Preprint,2020
3. Training data-efficient image transformers & distillation through attention;touvron;International Conference on Machine Learning,0
4. End-to-end object detection with transformers;carion;European Conference on Computer Vision,2020
5. Deformable detr: Deformable transformers for end-to-end object detection;zhu;ArXiv Preprint,2020