1. Learning transferable visual models from natural language supervision;Radford,2021
2. PaLM-E: An embodied multimodal language model;Driess,2023
3. Language models are few-shot learners;Brown,2020
4. LLaMA: Open and efficient foundation language models;Touvron,2023
5. Parameter-efficient transfer learning for NLP;Houlsby,2019