1. Visual prompting: Modifying pixel space to adapt pre-trained models;Bahng,2022
2. Beit: Bert pre-training of image transformers;Bao,2021
3. Simple, scalable adaptation for neural machine translation;Bapna,2019
4. Bitfit: Simple parameter-efficient fine-tuning for transformer-based masked language-models;Ben Zaken,2021
5. Is space–time attention all you need for video understanding?;Bertasius,2021