Vision-Language Pre-Training with Triple Contrastive Learning-Reference-Cited by-同舟云学术

Vision-Language Pre-Training with Triple Contrastive Learning

Published:2022-06 Issue: Volume: Page:
ISSN:
Container-title:2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)
language:
Short-container-title:

Author:

Yang Jinyu¹,Duan Jiali²,Tran Son²,Xu Yi²,Chanda Sampath²,Chen Liqun²,Zeng Belinda²,Chilimbi Trishul²,Huang Junzhou¹

Affiliation:

1. University Of Texas at Arlington

2. Amazon

Funder

Cancer Prevention and Research Institute of Texas (CPRIT)

Publisher

IEEE

Link

Reference49 articles.

1. Learning transferable visual models from natural language supervision;radford;Arxiv preprint arXiv,2021

2. Imagebert: Cross-modal pre-training with large-scale weak-supervised image-text data;qi;Arxiv preprint arXiv,2020

3. Repre-sentation learning with contrastive predictive coding;van den oord;ar Xiv preprint arXiv,2018

4. Unsupervised learning of visual representations by solving jigsaw puzzles;noroozi;European Conference on Computer Vision (ECCV),0

5. Decoupled weight decay regularization;loshchilov;Arxiv preprint arXiv,2017

Cited by 116 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

3. Detecting and Grounding Multi-Modal Media Manipulation and Beyond;IEEE Transactions on Pattern Analysis and Machine Intelligence;2024-08