Author:
Ruwa Nelson,Mao Qirong,Song Heping,Jia Hongjie,Dong Ming
Funder
National Nature Science Foundation of China
National Natural Science Foundation of China
the Natural Science Foundation of Jiangsu Province
Subject
Computer Vision and Pattern Recognition,Signal Processing,Software
Reference65 articles.
1. Anderson, P., He, X., Buehler, C., Teney, D., Johnson, M., Gould, S., Zhang, L., 2017. Bottom-up and top-down attention for image captioning and VQA, arXiv preprint arXiv:1707.07998.
2. VQA: visual question answering;Antol,2015
3. Ben-Younes, H., Cadene, R., Cord, M., Thome, N., 2017. Mutan: Multimodal tucker fusion for visual question answering, in: Proc. IEEE Int. Conf. Comp. Vis, Vol. 3.
4. A convolutional neural network for modelling sentences;Blunsom,2014
5. Large-scale visual sentiment ontology and detectors using adjective noun pairs;Borth,2013
Cited by
6 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献