1. Show, attend and tell: neural image caption generation with visual attention;Xu,2015
2. Image captioning with semantic attention;You,2016
3. Knowing when to look: adaptive attention via a visual sentinel for image captioning;Lu,2017
4. Show, observe and tell: attribute-driven attention model for image captioning.;Chen,2018
5. 3G structure for image caption generation;Yuan;Neurocomputing,2019