Recurrent convolutional video captioning with global and local attention-Reference-Cited by-同舟云学术

Recurrent convolutional video captioning with global and local attention

Author:

Jin Tao^ORCID,Li Yingming^ORCID,Zhang Zhongfei

Funder

National Natural Science Foundation of China

Zhejiang Lab

Central Universities in China

Publisher

Elsevier BV

Subject

Artificial Intelligence,Cognitive Neuroscience,Computer Science Applications

Reference44 articles.

1. Sequence to sequence-video to text;Venugopalan,2015

2. Video paragraph captioning using hierarchical recurrent neural networks;Yu,2016

3. Video Description Generation Incorporating Spatio-temporal Features and a Soft-attention Mechanism;Yao,2015

4. Attention-based multimodal fusion for video description;Hori,2017

5. Spatio-temporal attention models for grounded video captioning;Zanfir,2016

Cited by 23 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

2. Rethinking Missing Modality Learning from a Decoding Perspective;Proceedings of the 31st ACM International Conference on Multimedia;2023-10-26

4. Exploring Group Video Captioning with Efficient Relational Approximation;2023 IEEE/CVF International Conference on Computer Vision (ICCV);2023-10-01