Author:
Liu Jin,Wang GuoXiang,Fan ChongFeng,Zhou Fengyu,Xu HuiJuan
Subject
Artificial Intelligence,Information Systems and Management,Management Information Systems,Software
Reference69 articles.
1. VLDeformer: Vision–language decomposed transformer for fast cross-modal retrieval;Zhang;Knowl.-Based Syst.,2022
2. Graph convolutional networks in language and vision: A survey;Ren,2022
3. Y. Hong, Q. Wu, Y. Qi, C. Rodriguez-Opazo, S. Gould, Vln bert: A recurrent vision-and-language bert for navigation, in: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2021, pp. 1643–1653.
4. Inner knowledge-based Img2Doc scheme for visual question answering;Li;ACM Trans. Multimed. Comput. Commun. Appl. (TOMM),2022
5. C. Kervadec, G. Antipov, M. Baccouche, C. Wolf, Roses are red, violets are blue... but should vqa expect them to?, in: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2021, pp. 2776–2785.
Cited by
3 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献