1. Babytalk: understanding and generating simple image descriptions;Kulkarni;IEEE Trans. Pattern Anal. Mach. Intell.,2013
2. Show and tell: a neural image caption generator;Vinyals,2015
3. Show, attend and tell: Neural image caption generation with visual attention;Kelvin,2015
4. Visual relationship detection with language priors;Cewu,2016
5. Scene graph generation by iterative message passing;Xu,2017