Affiliation:
1. National University of Singapore,Show Lab
2. Sea AI Lab
Funder
National Research Foundation, Singapore under its NRFF
Reference48 articles.
1. A corpus for reasoning about natural language grounded in photographs;suhr;ArXiv Preprint,2018
2. Seeing Out of tHe bOx: End-to-End Pre-training for Vision-Language Representation Learning
3. Conceptual Captions: A Cleaned, Hypernymed, Image Alt-text Dataset For Automatic Image Captioning
4. Scaling upvision-language pre-training for image captioning;hu;Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition,0
5. Git: A generative image-to-text transformer for vision and language;wang;ArXiv Preprint,2022
Cited by
8 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献
1. Cross-modality interaction reasoning for enhancing vision-language pre-training in image-text retrieval;Applied Intelligence;2024-09-11
2. Prompt-Based Memory Bank for Continual Test-Time Domain Adaptation in Vision-Language Models;2024 International Joint Conference on Neural Networks (IJCNN);2024-06-30
3. Resource-efficient Text-based Person Re-identification on Embedded Devices;2024 20th International Conference on Distributed Computing in Smart Systems and the Internet of Things (DCOSS-IoT);2024-04-29
4. Learning Fine-Grained Information Alignment for Calibrated Cross-Modal Retrieval;ICASSP 2024 - 2024 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP);2024-04-14
5. Dual-Color Granularity Alignment for Text-Based Person Search;ICASSP 2024 - 2024 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP);2024-04-14