Rethinking the Reference-based Distinctive Image Captioning-Reference-Cited by-同舟云学术

Rethinking the Reference-based Distinctive Image Captioning

Published:2022-10-10 Issue: Volume: Page:
ISSN:
Container-title:Proceedings of the 30th ACM International Conference on Multimedia
language:
Short-container-title:

Author:

Mao Yangjun¹,Chen Long²,Jiang Zhihong¹,Zhang Dong³,Zhang Zhimeng¹,Shao Jian¹,Xiao Jun¹

Affiliation:

1. Zhejiang University, Hangzhou, China

2. Columbia University, New York, NY, USA

3. Hong Kong University of Science and Technology, Hong Kong, Hong Kong

Funder

National Key Research and Development Program of China

National Natural Science Foundation of China

Zhejiang Natural Science Foundation

Fundamental Research Funds for the Central Universities

Zhejiang Innovation Foundation

Publisher

ACM

Link

https://dl.acm.org/doi/pdf/10.1145/3503161.3548358

Reference51 articles.

1. Peter Anderson Xiaodong He Chris Buehler Damien Teney Mark Johnson Stephen Gould and Lei Zhang. 2018. Bottom-up and top-down attention for image captioning and visual question answering. In CVPR. Peter Anderson Xiaodong He Chris Buehler Damien Teney Mark Johnson Stephen Gould and Lei Zhang. 2018. Bottom-up and top-down attention for image captioning and visual question answering. In CVPR.

2. J. L. Ba J. R. Kiros and G. E. Hinton. 2016. Layer Normalization. (2016). J. L. Ba J. R. Kiros and G. E. Hinton. 2016. Layer Normalization. (2016).

3. Satanjeev Banerjee and Alon Lavie . 2005 . METEOR: An automatic metric for MT evaluation with improved correlation with human judgments . In ACL workshop. Satanjeev Banerjee and Alon Lavie. 2005. METEOR: An automatic metric for MT evaluation with improved correlation with human judgments. In ACL workshop.

4. Fuhai Chen , Rongrong Ji , Xiaoshuai Sun , Yongjian Wu , and Jinsong Su . 2018 . Groupcap: Group-based image captioning with structured relevance and diversity constraints. In CVPR. 1345--1353. Fuhai Chen, Rongrong Ji, Xiaoshuai Sun, Yongjian Wu, and Jinsong Su. 2018. Groupcap: Group-based image captioning with structured relevance and diversity constraints. In CVPR. 1345--1353.

5. Human-like Controllable Image Captioning with Verb-specific Semantic Roles

Cited by 14 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Learning Combinatorial Prompts for Universal Controllable Image Captioning;International Journal of Computer Vision;2024-07-22

2. Multi-Source Dynamic Interactive Network Collaborative Reasoning Image Captioning;ICASSP 2024 - 2024 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP);2024-04-14

3. Visual contextual relationship augmented transformer for image captioning;Applied Intelligence;2024-03

4. Weakly supervised grounded image captioning with semantic matching;Applied Intelligence;2024-03

5. A comprehensive literature review on image captioning methods and metrics based on deep learning technique;Multimedia Tools and Applications;2024-02-20