1. An image is worth 16×16 words: Transformers for image recognition at scale;dosovitskiy;International Conference on Learning Representations,2021
2. Panda: Prompt transfer meets knowledge distillation for efficient model adaptation;zhong;ArXiv Preprint,2022
3. Analyzing Redundancy in Pretrained Transformer Models
4. Point Transformer
5. Head2toe: Utilizing intermediate rep-resentations for better transfer learning;evci;International Conference on Machine Learning,2022