Author:
Velda Vania,Immanuel Steve Andreas,Hendria Willy Fitra,Jeong Cheol
Funder
Division of Human Resource Development
Ministry of Science, ICT and Future Planning
Institute for Information and Communications Technology Promotion
Subject
Computer Vision and Pattern Recognition,Signal Processing
Reference48 articles.
1. S. Venugopalan, M. Rohrbach, J. Donahue, R. Mooney, T. Darrell, K. Saenko, Sequence to sequence – video to text, in: Proc. ICCV, 2015, pp. 4534–4542.
2. avtmNet: Adaptive Visual-Text Merging Network for Image Captioning;Song;Comput. Electr. Eng.,2020
3. ArCo: Attention-reinforced transformer with contrastive learning for image captioning;Wang;Image Vis. Comput.,2022
4. Contrastive learning for image captioning;Dai,2017
5. R. Luo, B. Price, S. Cohen, G. Shakhnarovich, Discriminability objective for training descriptive captions, in: Proc. CVPR, 2018, pp. 6964–6974.
Cited by
3 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献