Revisiting the “Video” in Video-Language Understanding-Reference-Cited by-同舟云学术

Revisiting the “Video” in Video-Language Understanding

Published:2022-06 Issue: Volume: Page:
ISSN:
Container-title:2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)
language:
Short-container-title:

Author:

Buch Shyamal¹,Eyzaguirre Cristobal¹,Gaidon Adrien²,Wu Jiajun¹,Fei-Fei Li¹,Niebles Juan Carlos¹

Affiliation:

1. Stanford University

2. Toyota Research Institute

Funder

Toyota Research Institute

Samsung

Salesforce

Office of Naval Research

Publisher

IEEE

Link

Reference62 articles.

1. Learning spatiotemporal representation with pseudo-3d residual networks;qiu;CVPR,2017

2. An information divergence measure between neural text and human text;pillutla;NeurIPS,2021

4. Roberta: A robustly optimized bert pretraining approach;liu;ArXiv Preprint,2019

Cited by 23 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

2. Contrastive Video Question Answering via Video Graph Transformer;IEEE Transactions on Pattern Analysis and Machine Intelligence;2023-11-01

3. Language-Guided Visual Aggregation Network for Video Question Answering;Proceedings of the 31st ACM International Conference on Multimedia;2023-10-26

4. Redundancy-aware Transformer for Video Question Answering;Proceedings of the 31st ACM International Conference on Multimedia;2023-10-26

5. ATM: Action Temporality Modeling for Video Question Answering;Proceedings of the 31st ACM International Conference on Multimedia;2023-10-26