Funder
National Key R&D Program of China
Joint Advanced Research Foundation of China Electronics Technology Group Corporation
Publisher
Springer Science and Business Media LLC
Subject
Computer Networks and Communications,Hardware and Architecture,Media Technology,Software
Reference110 articles.
1. Xu K, Ba JL, Kiros R, et al (2015) Show, attend and tell: Neural image caption generation with visual attention. 32nd International Conference on Machine Learning, ICML 2015 3:2048–2057
2. Mandal D, Biswas S (2017) Query specific re-ranking for improved cross-modal retrieval. Pattern Recognit Lett 98:110–116. https://doi.org/10.1016/j.patrec.2017.09.008
3. Agrawal A, Lu J, Antol S et al (2017) VQA: Visual question answering. Int J Comput Vision 123:4–31. https://doi.org/10.1007/s11263-016-0966-6
4. Yu Z, Yu J, Cui Y et al (2019) Deep Modular Co-attention networks for visual question answering. In: 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR). IEEE, pp 6274–6283
5. Malinowski M, Rohrbach M, Fritz M (2015) Ask Your Neurons: A Neural-Based Approach to Answering Questions about Images. In: 2015 IEEE International Conference on Computer Vision (ICCV). IEEE, pp 1–9
Cited by
2 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献