F-SCP: An automatic prompt generation method for specific classes based on visual language pre-training models-Reference-Cited by-同舟云学术

F-SCP: An automatic prompt generation method for specific classes based on visual language pre-training models

Published:2024-03 Issue: Volume:147 Page:110096
ISSN:0031-3203
Container-title:Pattern Recognition
language:en
Short-container-title:Pattern Recognition

Author:

Han Baihong,Jiang Xiaoyan^ORCID,Fang Zhijun,Fujita Hamido^ORCID,Gao Yongbin

Funder

Science and Technology Commission of Shanghai Municipality

National Natural Science Foundation of China

Publisher

Elsevier BV

Subject

Artificial Intelligence,Computer Vision and Pattern Recognition,Signal Processing,Software

Reference39 articles.

1. J. Devlin, M.-W. Chang, K. Lee, K. Toutanova, Bert: Pre-training of deep bidirectional transformers for language understanding, in: Proceedings of the Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Vol. 1(Long and Short Papers) Association for Computational Linguistics, Minneapolis, Minnesota, 2018, pp. 4171—4186.

2. Language models are few-shot learners;Brown,2020

3. Training language models to follow instructions with human feedback;Ouyang,2022

4. A. Radford, J.W. Kim, C. Hallacy, A. Ramesh, G. Goh, S. Agarwal, G. Sastry, A. Askell, P. Mishkin, J. Clark, et al., Learning transferable visual models from natural language supervision, in: International Conference on Machine Learning, (ICML), 2021, pp. 8748–8763.

5. C. Jia, Y. Yang, Y. Xia, Y.-T. Chen, Z. Parekh, H. Pham, Q. Le, Y.-H. Sung, Z. Li, T. Duerig, Scaling up visual and vision-language representation learning with noisy text supervision, in: International Conference on Machine Learning, (ICML), 2021, pp. 4904–4916.

Cited by 3 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Cluster prototype earth mover’s distance adapters and alignment-guided prompt learning for vision–language models;Pattern Recognition;2024-12

2. Ta-Adapter: Enhancing few-shot CLIP with task-aware encoders;Pattern Recognition;2024-09

3. Improvements in natural language understanding using deep learning;Third International Conference on Electronic Information Engineering and Data Processing (EIEDP 2024);2024-07-05