1. Composing simple image descriptions using web-scale n-grams;Li
2. Im2Text: Describing Images Using 1 Million Captioned Photographs;Ordonez;Neural Information Processing Systems,2011
3. Show and Tell: A Neural Image Caption Generator;Vinyals;IEEE,2015
4. Show, Attend and Tell: Neural Image Caption Generation with Visual Attention;Xu;Computer Science,2015
5. Knowing When to Look: Adaptive Attention via a Visual Sentinel for Image Captioning