Funder
National Key Research and Development Program of China
National Natural Science Foundation of China
Innovative Research Group of the National Natural Science Founda- tion of China
Publisher
Springer Science and Business Media LLC
Subject
Artificial Intelligence,Computer Vision and Pattern Recognition,Software
Reference52 articles.
1. Anderson, P., Wu, Q., Teney, D., et al. (2018). Vision-and-language navigation: Interpreting visually-grounded navigation instructions in real environments. In CVPR, pp 3674–3683.
2. Antol, S., Agrawal, A., Lu, J., et al. (2015). Vqa: Visual question answering. In ICCV, pp 2425–2433.
3. Ba, J. L., Kiros, J. R., & Hinton, G. E. (2016). Layer normalization. arXiv preprint arXiv:1607.06450
4. Caesar, H., Uijlings, J., & Ferrari, V. (2016). Region-based semantic segmentation with end-to-end training. In ECCV, pp 381–397.
5. Chen, Y., Li, L., Yu, L., et al. (2020). UNITER: universal image-text representation learning. In A. Vedaldi, H. Bischof, T. Brox et al (eds) ECCV, pp 104–120.
Cited by
3 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献