CLIP-Adapter: Better Vision-Language Models with Feature Adapters-Reference-Cited by-同舟云学术

CLIP-Adapter: Better Vision-Language Models with Feature Adapters

Published:2023-09-15 Issue:2 Volume:132 Page:581-595
ISSN:0920-5691
Container-title:International Journal of Computer Vision
language:en
Short-container-title:Int J Comput Vis

Author:

Gao Peng,Geng Shijie,Zhang Renrui,Ma Teli,Fang Rongyao,Zhang Yongfeng,Li Hongsheng,Qiao Yu

Publisher

Springer Science and Business Media LLC

Subject

Artificial Intelligence,Computer Vision and Pattern Recognition,Software

Link

https://link.springer.com/content/pdf/10.1007/s11263-023-01891-x.pdf

Reference70 articles.

1. Alayrac, J. B., Donahue, J., Luc, P., et al. (2022). Flamingo: a visual language model for few-shot learning. In A. H. Oh, A. Agarwal, D. Belgrave, et al. (Eds.), Advances in neural information processing systems. MIT Press.

2. Anderson, P., He, X., & Buehler, C., et al. (2018). Bottom-up and top-down attention for image captioning and visual question answering. In CVPR.

3. Bossard, L., Guillaumin, M., & Van Gool, L. (2014). Food-101–mining discriminative components with random forests. In European conference on computer vision, Springer, pp. 446–461.

4. Brown, T., Mann, B., & Ryder, N., et al. (2020). Language models are few-shot learners. In NeurIPS.

5. Carion, N., Massa, F., & Synnaeve, G., et al. (2020). End-to-end object detection with transformers. In ECCV.

Cited by 103 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Cluster prototype earth mover’s distance adapters and alignment-guided prompt learning for vision–language models;Pattern Recognition;2024-12

2. Embedded prompt tuning: Towards enhanced calibration of pretrained models for medical images;Medical Image Analysis;2024-10

3. Self-supervised visual–textual prompt learning for few-shot grading of gastric intestinal metaplasia;Knowledge-Based Systems;2024-10

4. Facilitating self-directed language learning in real-life scene description tasks with automated evaluation;Computers & Education;2024-10

5. WeakCLIP: Adapting CLIP for Weakly-Supervised Semantic Segmentation;International Journal of Computer Vision;2024-09-05