Funder
National Natural Science Foundation of China
Reference67 articles.
1. S. Antol, A. Agrawal, J. Lu, et al., Vqa: Visual question answering, in: Proceedings of the IEEE International Conference on Computer Vision., 2015, pp. 2425–2433.
2. Z. Yu, Y. Cui, J. Yu, et al., Deep multimodal neural architecture search, in: Proceedings of the 28th ACM International Conference on Multimedia., 2020, pp. 3743–3752.
3. Attention is all you need;Vaswani;Adv. Neural Inf. Process. Syst.,2017
4. Rich visual knowledge-based augmentation network for visual question answering;Zhang;IEEE Trans. Neural Netw. Learn. Syst.,2020
5. K. Kafle, C. Kanan, An analysis of visual question answering algorithms, in: Proceedings of the IEEE International Conference on Computer Vision., 2017, pp. 1965–1973.
Cited by
1 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献