Funder
Japan Society for the Promotion of Science
Hosei University
Reference51 articles.
1. Oriol Vinyals, Alexander Toshev, Samy Bengio, D. Erhan, Show and tell: A neural image caption generator, in: CVPR, 2015, pp. 3156–3164.
2. Jeff Donahue, Lisa Anne Hendricks, Marcus Rohrbach, Subhashini Venugopalan, Sergio Guadarrama, Kate Saenko, Trevor Darrell, Long-Term Recurrent Convolutional Networks for Visual Recognition and Description, in: TPAMI, volume 39, 2017, pp. 677–691.
3. Jiasen Lu, Caiming Xiong, Devi Parikh, Richard Socher, Knowing When to Look: Adaptive Attention via a Visual Sentinel for Image Captioning, in: CVPR, 2017, pp. 3242–3250.
4. Show, attend and tell: Neural image caption generation with visual attention;Xu,2015
5. Peter Anderson, Xiaodong He, Chris Buehler, Damien Teney, Mark Johnson, Stephen Gould, Lei Zhang, Bottom-Up and Top-Down Attention for Image Captioning and Visual Question Answering, in: CVPR, 2018.