1. Show and tell: A neural image caption generator.;Vinyals,2015
2. Image captioning with semantic attention.;You,2016
3. Deep visual-semantic alignments for generating image descriptions.;Karpathy,2015
4. Show, attend and tell: Neural image caption generation with visual attention.;Xu,2015
5. Knowing when to look: Adaptive attention via a visual sentinel for image captioning.;Xiong,2017