Smallcap: Lightweight Image Captioning Prompted with Retrieval Augmentation-Reference-Cited by-同舟云学术

Smallcap: Lightweight Image Captioning Prompted with Retrieval Augmentation

Published:2023-06 Issue: Volume: Page:
ISSN:
Container-title:2023 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)
language:
Short-container-title:

Author:

Ramos Rita¹,Martins Bruno¹,Elliott Desmond²,Kementchedjhieva Yova²

Affiliation:

1. INESC-ID, Instituto Superior Técnico, University of Lisbon

2. University of Copenhagen,Department of Computer Science

Publisher

IEEE

Link

http://xplorestaging.ieee.org/ielx7/10203037/10203050/10203784.pdf?arnumber=10203784

Reference49 articles.

1. A good prompt is worth millions of pa-rameters? low-resource prompt-based learning for vision-language models;jin;ArXiv Preprint,2021

2. Few-shot learning with retrieval augmented language models;izacard;ArXiv Preprint,2022

3. Deep visual-semantic align-ments for generating image descriptions;karpathy;Proceedings of the IEEE Conference on Computer Vision and Pattern Recog-nition,0

4. Billion-scale similarity search with gpus;johnson;ArXiv Preprint,2017

5. Scaling Up Vision-Language Pretraining for Image Captioning

Cited by 20 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. A surrogate-assisted extended generative adversarial network for parameter optimization in free-form metasurface design;Neural Networks;2024-12

2. GRPIC: an end-to-end image captioning model using three visual features;International Journal of Machine Learning and Cybernetics;2024-09-04

3. MyUEVision: an application generating image caption for assisting visually impaired people;Journal of Enabling Technologies;2024-09-03

4. Military Image Captioning for Low-Altitude UAV or UGV Perspectives;Drones;2024-08-24

5. A Survey on RAG Meeting LLMs: Towards Retrieval-Augmented Large Language Models;Proceedings of the 30th ACM SIGKDD Conference on Knowledge Discovery and Data Mining;2024-08-24