Pic2Word: Mapping Pictures to Words for Zero-shot Composed Image Retrieval-Reference-Cited by-同舟云学术

Pic2Word: Mapping Pictures to Words for Zero-shot Composed Image Retrieval

Published:2023-06 Issue: Volume: Page:
ISSN:
Container-title:2023 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)
language:
Short-container-title:

Author:

Saito Kuniaki¹,Sohn Kihyuk²,Zhang Xiang³,Li Chun-Liang³,Lee Chen-Yu³,Saenko Kate¹,Pfister Tomas³

Affiliation:

1. Boston University

2. Google Research

3. Google Cloud AI Research

Publisher

IEEE

Link

http://xplorestaging.ieee.org/ielx7/10203037/10203050/10203840.pdf?arnumber=10203840

Reference39 articles.

1. Fashionvlp: Vision language transformer for fashion re-trieval with feedback;goenka;CVPR,2022

2. Composing Text and Image for Image Retrieval - an Empirical Odyssey

3. An image is worth one word: Personalizing text-to-image generation using textual inversion;gal;ArXiv Preprint,2022

4. CLIP Models are Few-Shot Learners: Empirical Studies on VQA and Visual Entailment

5. The many faces of robust-ness: A critical analysis of out-of-distribution generalization;hendrycks;Proceedings of the IEEEICVF International Conference on Computer Vision,0

Cited by 16 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. From text to mask: Localizing entities using the attention of text-to-image diffusion models;Neurocomputing;2024-12

2. An efficient zero-labeling segmentation approach for pest monitoring on smartphone-based images;European Journal of Agronomy;2024-10

3. Backward induction-based deep image search;PLOS ONE;2024-09-09

4. Fine-grained Textual Inversion Network for Zero-Shot Composed Image Retrieval;Proceedings of the 47th International ACM SIGIR Conference on Research and Development in Information Retrieval;2024-07-10

5. LDRE: LLM-based Divergent Reasoning and Ensemble for Zero-Shot Composed Image Retrieval;Proceedings of the 47th International ACM SIGIR Conference on Research and Development in Information Retrieval;2024-07-10