NaQ: Leveraging Narrations as Queries to Supervise Episodic Memory-Reference-Cited by-同舟云学术

NaQ: Leveraging Narrations as Queries to Supervise Episodic Memory

Published:2023-06 Issue: Volume: Page:
ISSN:
Container-title:2023 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)
language:
Short-container-title:

Author:

Ramakrishnan Santhosh Kumar¹,Al-Halah Ziad²,Grauman Kristen¹

Affiliation:

1. UT Austin

2. University of Utah

Publisher

IEEE

Link

http://xplorestaging.ieee.org/ielx7/10203037/10203050/10204810.pdf?arnumber=10204810

Reference42 articles.

1. Rolling-Unrolling LSTMs for Action Anticipation from First-Person Video

2. Vlm: Task-agnostic video-language model pre-training for video understanding;hu;Findings of the Association for Computational Linguistics ACL-IJCNLP 2021,0

3. SlowFast Networks for Video Recognition

4. Video question answering via gradually refined attention over appearance and motion;dejing;Proceedings of the 25th ACM international conference on Multimedia,0

5. Anticipative Video Transformer

Cited by 5 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. An Outlook into the Future of Egocentric Vision;International Journal of Computer Vision;2024-05-28

2. Helping Hands: An Object-Aware Ego-Centric Video Recognition Model;2023 IEEE/CVF International Conference on Computer Vision (ICCV);2023-10-01

3. UniVTG: Towards Unified Video-Language Temporal Grounding;2023 IEEE/CVF International Conference on Computer Vision (ICCV);2023-10-01

4. HierVL: Learning Hierarchical Video-Language Embeddings;2023 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR);2023-06

5. Time-Aware Circulant Matrices for Question-Based Temporal Localization;Image Analysis and Processing – ICIAP 2023;2023