Affiliation:
1. Harbin Institute of Technology at Shenzhen, Shenzhen, China
Funder
NSFC Fund
Basic and Applied Basic Research Foundation of Guangdong Province
Guangdong Provincial Key Laboratory of Novel Security Intelligence Technologies
Shenzhen Key Technical Project
Shenzhen Fundamental Research and Discipline Layout project
Publisher
Institute of Electrical and Electronics Engineers (IEEE)
Subject
Electrical and Electronic Engineering,Media Technology
Reference64 articles.
1. A comprehensive survey on cross-modal retrieval;wang;arXiv 1607 06215,2016
2. Unicoder-VL: A Universal Encoder for Vision and Language by Cross-Modal Pre-Training
3. Cross-modal Retrieval with Correspondence Autoencoder
4. ViLT: Vision-and-language transformer without convolution or region supervision;kim;Proc IEEE Inter Conf Mach Learn (ICML),2021
5. Deep Supervised Cross-Modal Retrieval
Cited by
3 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献