1. End-to-end object detection with transformers;carion;Proceedings of the European Conference on Computer Vision (ECCV),2020
2. Bleu: a method for automatic evaluation of machine translation;papineni;Proceedings of the 40th Annual Meeting on Association for Computational Linguistics - ACL '02,2002
3. Tokens-to-Token ViT: Training Vision Transformers from Scratch on ImageNet
4. Regularizing RNNs for Caption Generation by Reconstructing the Past with the Present
5. Swin Transformer: Hierarchical Vision Transformer using Shifted Windows