1. Deep visual-semantic alignments for generating image descriptions;A Karpathy;IEEE Conference on Computer Vision and Pattern Recognition (CVPR),2015
2. Contextually customized video summaries via natural language;J Choi;IEEE Winter Conference on Applications of Computer Vision (WACV),2018
3. Look before you leap: Bridging model-free and model-based reinforcement learning for planned-ahead vision-and-language navigation;X Wang;European Conference on Computer Vision (ECCV),2018
4. Textual explanations for self-driving vehicles;J Kim;European Conference on Computer Vision (ECCV),2018
5. Bottom-up and topdown attention for image captioning and vqa;P Anderson;IEEE Conference on Computer Vision and Pattern Recognition (CVPR),2018