1. Vqa: visual question answering;Antol;Int. J. Comput. Vis.,2017
2. Learning multilayer channel features for pedestrian detection;Cao;IEEE Trans. Image Process.,2017
3. Temporal-difference learning with sampling baseline for image captioning;Chen,2018
4. J. Chung, C. Gulcehre, K. Cho, Y. Bengio, Empirical evaluation of gated recurrent neural networks on sequence modeling, arXiv:1412.3555v1 (2014).
5. Pedestrian attribute recognition at far distance;Deng,2014