Equivariant and Invariant Grounding for Video Question Answering

Author:

Li Yicong1,Wang Xiang2,Xiao Junbin1,Chua Tat-Seng1

Affiliation:

1. National University of Singapore, Singapore, Singapore

2. University of Science and Technology of China, Hefei, Singapore

Publisher

ACM

Reference54 articles.

1. Peter Anderson , Qi Wu , Damien Teney , Jake Bruce , Mark Johnson , Niko Sünderhauf , Ian D. Reid , Stephen Gould , and Anton van den Hengel . 2018. Vision-and- Language Navigation: Interpreting Visually-Grounded Navigation Instructions in Real Environments . In IEEE CVPR. 3674--3683. Peter Anderson, Qi Wu, Damien Teney, Jake Bruce, Mark Johnson, Niko Sünderhauf, Ian D. Reid, Stephen Gould, and Anton van den Hengel. 2018. Vision-and- Language Navigation: Interpreting Visually-Grounded Navigation Instructions in Real Environments. In IEEE CVPR. 3674--3683.

2. Martín Arjovsky Léon Bottou Ishaan Gulrajani and David Lopez-Paz. 2019. Invariant Risk Minimization. Martín Arjovsky Léon Bottou Ishaan Gulrajani and David Lopez-Paz. 2019. Invariant Risk Minimization.

3. Chaofan Chen , Oscar Li , Alina Barnett , Jonathan Su , and Cynthia Rudin . 2018. This looks like that: deep learning for interpretable image recognition. CoRR ( 2018 ). Chaofan Chen, Oscar Li, Alina Barnett, Jonathan Su, and Cynthia Rudin. 2018. This looks like that: deep learning for interpretable image recognition. CoRR (2018).

4. Long Chen , Xin Yan , Jun Xiao , Hanwang Zhang , Shiliang Pu , and Yueting Zhuang . 2020. Counterfactual Samples Synthesizing for Robust Visual Question Answering . In IEEE CVPR. 10797--10806. Long Chen, Xin Yan, Jun Xiao, Hanwang Zhang, Shiliang Pu, and Yueting Zhuang. 2020. Counterfactual Samples Synthesizing for Robust Visual Question Answering. In IEEE CVPR. 10797--10806.

5. Elliot Creager , Jörn-Henrik Jacobsen , and Richard S . Zemel . 2021 . Environment Inference for Invariant Learning. In ICML (Proceedings of Machine Learning Research) . 2189--2200. Elliot Creager, Jörn-Henrik Jacobsen, and Richard S. Zemel. 2021. Environment Inference for Invariant Learning. In ICML (Proceedings of Machine Learning Research). 2189--2200.

Cited by 7 articles. 订阅此论文施引文献 订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献

1. Contrastive Video Question Answering via Video Graph Transformer;IEEE Transactions on Pattern Analysis and Machine Intelligence;2023-11-01

2. ATM: Action Temporality Modeling for Video Question Answering;Proceedings of the 31st ACM International Conference on Multimedia;2023-10-26

3. Mixup-Augmented Temporally Debiased Video Grounding with Content-Location Disentanglement;Proceedings of the 31st ACM International Conference on Multimedia;2023-10-26

4. Visual Causal Scene Refinement for Video Question Answering;Proceedings of the 31st ACM International Conference on Multimedia;2023-10-26

5. Discovering Spatio-Temporal Rationales for Video Question Answering;2023 IEEE/CVF International Conference on Computer Vision (ICCV);2023-10-01

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3