1. Show and tell: a neural image caption generator;Vinyals,2015
2. Long-term recurrent convolutional networks for visual recognition and description;Donahue,2015
3. Sequence to sequence - video to text;Venugopalan,2015
4. Learning like a child: fast novel visual concept learning from sentence descriptions of images;Mao,2015
5. Deep visual-semantic alignments for generating image descriptions;Karpathy,2015