Author:
Suhr Alane,Zhou Stephanie,Zhang Ally,Zhang Iris,Bai Huajun,Artzi Yoav
Publisher
Association for Computational Linguistics
Cited by
88 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献
1. A survey on knowledge-enhanced multimodal learning;Artificial Intelligence Review;2024-09-09
2. Probing Fundamental Visual Comprehend Capabilities on Vision Language Models via Visual Phrases from Structural Data;Cognitive Computation;2024-09-05
3. Vision-Language Models for Vision Tasks: A Survey;IEEE Transactions on Pattern Analysis and Machine Intelligence;2024-08
4. Fine-grained Textual Inversion Network for Zero-Shot Composed Image Retrieval;Proceedings of the 47th International ACM SIGIR Conference on Research and Development in Information Retrieval;2024-07-10
5. CaLa: Complementary Association Learning for Augmenting Comoposed Image Retrieval;Proceedings of the 47th International ACM SIGIR Conference on Research and Development in Information Retrieval;2024-07-10