1. Nocaps: Novel object captioning at scale;Agrawal;In Proceedings ofthe IEEE/CVF International Conference on Computer Vision (pp,2019
2. Aker, A., & Gaižauskas, R. (2010). Generating image descriptions using dependency relational patterns. In Proceedings of the 48th annual meeting of the association for computational linguistics (pp. 1250-1258).
3. Bottom-up and top-down attention for image captioning and visual question answering;Anderson;In Proceedings ofthe IEEE Conference on Computer Vision and Pattern Recognition (pp,2018
4. Anwar, S., Hwang, K., & Sung, W. (2017). Structured pruning of deep convolutional neural networks. ACM Journal on Emerging Technologies in Computing Systems (JETC), 13(3): 1-18.
5. Bahdanau, D., Cho, K., & Bengio, Y. (2015). Neural machine translation by jointly learn- ing to align and translate. In Proceedings of the International Conference on Learning Representations.