Author:
Zhang Renrui,Zhang Wei,Fang Rongyao,Gao Peng,Li Kunchang,Dai Jifeng,Qiao Yu,Li Hongsheng
Publisher
Springer Nature Switzerland
Reference73 articles.
1. Anderson, P., et al.: Bottom-up and top-down attention for image captioning and visual question answering. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 6077–6086 (2018)
2. Antol, S., et al.: VQA: visual question answering. In: Proceedings of the IEEE International Conference on Computer Vision, pp. 2425–2433 (2015)
3. Lecture Notes in Computer Science;L Bossard,2014
4. Brown, T.B., et al.: Language models are few-shot learners. arXiv preprint arXiv:2005.14165 (2020)
5. Lecture Notes in Computer Science;N Carion,2020
Cited by
20 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献