1. Gradient-based learning applied to document recognition;LeCun;Proc. IEEE,1998
2. I. Sutskever, J. Martens, G.E. Hinton, Generating text with recurrent neural networks, in: ICML, 2011.
3. Long short-term memory;Hochreiter;Neural Comput.,1997
4. A Comprehensive Survey of Deep Learning for Image Captioning;Hossain;ACM Comput. Surv.,2019
5. K. Xu, J. Ba, R. Kiros, K. Cho, A. Courville, R. Salakhudinov, R. Zemel, Y. Bengio, Show, attend and tell: Neural image caption generation with visual attention, in: International conference on machine learning, 2015, pp. 2048–2057.