1. Analyzing the Behavior of Visual Question Answering Models
2. Peter Anderson , Basura Fernando , Mark Johnson , and Stephen Gould . 2016 . SPICE: Semantic Propositional Image Caption Evaluation. In ECCV. Peter Anderson, Basura Fernando, Mark Johnson, and Stephen Gould. 2016. SPICE: Semantic Propositional Image Caption Evaluation. In ECCV.
3. Peter Anderson , Xiaodong He , Chris Buehler , Damien Teney , Mark Johnson , Stephen Gould , and Lei Zhang . 2018 . Bottom-Up and Top-Down Attention for Image Captioning and Visual Question Answering. In The IEEE Conference on Computer Vision and Pattern Recognition (CVPR). Peter Anderson, Xiaodong He, Chris Buehler, Damien Teney, Mark Johnson, Stephen Gould, and Lei Zhang. 2018. Bottom-Up and Top-Down Attention for Image Captioning and Visual Question Answering. In The IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
4. J. Aneja , A. Deshpande , and A. G. Schwing . 2018 . Convolutional Image Captioning. In 2018 IEEE Conference on Computer Vision and Pattern Recognition. J. Aneja, A. Deshpande, and A. G. Schwing. 2018. Convolutional Image Captioning. In 2018 IEEE Conference on Computer Vision and Pattern Recognition.
5. Probing the Need for Visual Context in Multimodal Machine Translation