1. Watch your step: Learning node embeddings via graph attention;Abu-El-Haija Sami;Advances in Neural Information Processing Systems (NeurIPS’18),2018
2. Peter Anderson, Xiaodong He, Chris Buehler, Damien Teney, Mark Johnson, Stephen Gould, and Lei Zhang. 2018. Bottom-up and top-down attention for image captioning and visual question answering. In IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR’18). 6077–6086.
3. Gedas Bertasius, Lorenzo Torresani, Stella X. Yu, and Jianbo Shi. 2017. Convolutional random walk networks for semantic image segmentation. In IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR’17). 858–866.
4. Remi Cadene, Hedi Ben-Younes, Matthieu Cord, and Nicolas Thome. 2019. Murel: Multimodal relational reasoning for visual question answering. In IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR’19). 1989–1998.
5. Chunshui Cao, Xianming Liu, Yi Yang, Yinan Yu, Jiang Wang, Zilei Wang, Yongzhen Huang, Liang Wang, Chang Huang, Wei Xu, et al. 2015. Look and think twice: Capturing top-down visual attention with feedback convolutional neural networks. In IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR’15). 2956–2964.