1. Differential attention for visual question answering;Patro;IEEE/CVF Conference on Computer Vision and Pattern Recognition,2018
2. Show, attend and tell: Neural image caption generation with visual attention;Xu,2015
3. Sca-cnn: Spatial and channel-wise attention in convolutional networks for image captioning;Chen;IEEE Conference on Computer Vision and Pattern Recognition (CVPR),2017
4. Show and tell: A neural image caption generator;Vinyals;IEEE Conference on Computer Vision and Pattern Recognition (CVPR),2015
5. Bottom-up and top-down attention for image captioning and visual question answering;Anderson;2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition,2018