Funder
Institute for Information Communication Technology Planning and Evaluation
Subject
Artificial Intelligence,Cognitive Neuroscience
Reference50 articles.
1. Anderson, P., He, X., Buehler, C., Teney, D., Johnson, M., & Gould, S., et al. (2018). Bottom-up and top-down attention for image captioning and visual question answering. In Proceedings of the IEEE conference on computer vision and pattern recognition (pp. 6077–6086).
2. Antol, S., Agrawal, A., Lu, J., Mitchell, M., Batra, D., & Lawrence Zitnick, C., et al. (2015). VQA: Visual question answering. In Proceedings of the IEEE international conference on computer vision (pp. 2425–2433).
3. Bahdanau, D., Cho, K., & Bengio, Y. (2015). Neural machine translation by jointly learning to align and translate. In International conference on learning representations (pp. 1–15).
4. Ben-Younes, H., Cadene, R., Cord, M., & Thome, N. (2017). Mutan: Multimodal tucker fusion for visual question answering. In Proceedings of the IEEE international conference on computer vision (pp. 2612–2620).
5. Cadene, R., Ben-Younes, H., Cord, M., & Thome, N. (2019). Murel: Multimodal relational reasoning for visual question answering. In Proceedings of the IEEE conference on computer vision and pattern recognition (pp. 1989–1998).
Cited by
18 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献