1. Exploring visual prompts for adapting large-scale models;Bahng,2022
2. Language-aware soft prompting for vision & language foundation models;Bulat;CoRR,2022
3. VLP: A Survey on Vision-language Pre-training
4. Prompt learning with optimal transport for vision-language models;Chen;CoRR,2022
5. Microsoft coco captions: Data collection and evaluation server;Chen,2015