1. Sequence to sequence learning with neural networks;Sutskever,2014
2. Sequence to sequence – video to text;Venugopalan,2015
3. Describing videos by exploiting temporal structure;Yao,2015
4. Hierarchical recurrent neural encoder for video representation with application to captioning;Pan,2016
5. Translating video content to natural language descriptions;Rohrbach,2013