From Representation to Reasoning: Towards both Evidence and Commonsense Reasoning for Video Question-Answering-Reference-Cited by-同舟云学术

From Representation to Reasoning: Towards both Evidence and Commonsense Reasoning for Video Question-Answering

Published:2022-06 Issue: Volume: Page:
ISSN:
Container-title:2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)
language:
Short-container-title:

Author:

Li Jiangtong¹,Niu Li¹,Zhang Liqing¹

Affiliation:

1. MoE Key Lab of Artificial Intelligence, Shanghai Jiao Tong University,Department of Computer Science and Engineering

Publisher

IEEE

Link

Reference57 articles.

2. Temporal aggregate representations for long-range video understanding;sener;ECCV 2020,2020

4. A dataset and explo-ration of models for understanding video data through fill-in-the-blank question-answering;tegan;CVPR 2017,2017

Cited by 16 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

3. Contrastive Video Question Answering via Video Graph Transformer;IEEE Transactions on Pattern Analysis and Machine Intelligence;2023-11-01

4. ATM: Action Temporality Modeling for Video Question Answering;Proceedings of the 31st ACM International Conference on Multimedia;2023-10-26

5. Redundancy-aware Transformer for Video Question Answering;Proceedings of the 31st ACM International Conference on Multimedia;2023-10-26