Author:
Li Xiujun,Yin Xi,Li Chunyuan,Zhang Pengchuan,Hu Xiaowei,Zhang Lei,Wang Lijuan,Hu Houdong,Dong Li,Wei Furu,Choi Yejin,Gao Jianfeng
Publisher
Springer International Publishing
Reference41 articles.
1. Agrawal, H., et al.: Nocaps: novel object captioning at scale. In: ICCV (2019)
2. Anderson, P., et al.: Bottom-up and top-down attention for image captioning and visual question answering. In: CVPR (2018)
3. Brown, P.F., Lai, J.C., Mercer, R.L.: Aligning sentences in parallel corpora. In: Proceedings of the 29th Annual Meeting on Association for Computational Linguistics (1991)
4. Chen, W., Gan, Z., Li, L., Cheng, Y., Wang, W., Liu, J.: Meta module network for compositional visual reasoning (2019). arXiv preprint arXiv:1910.03230
5. Chen, Y.C., et al.: Uniter: learning universal image-text representations (2019). arXiv preprint arXiv:1909.11740
Cited by
493 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献