1. A. Radford, J.W. Kim, C. Hallacy, A. Ramesh, G. Goh, S. Agarwal, G. Sastry, A. Askell, P. Mishkin, J. Clark, et al., Learning transferable visual models from natural language supervision, in: International Conference on Machine Learning, 2021, pp. 8748–8763.
2. F-SCP: An automatic prompt generation method for specific classes based on visual language pre-training models;Han;Pattern Recognit.,2024
3. M.U. Khattak, S.T. Wasim, M. Naseer, F.S. Khan, Self-regulating prompts: Foundational model adaptation without forgetting, in: International Conference on Computer Vision, 2023, pp. 15190–15200.
4. Ta-adapter: Enhancing few-shot CLIP with task-aware encoders;Zhang;Pattern Recognit.,2024
5. Learning to prompt for vision-language models;Zhou;Int. J. Comput. Vis.,2022