Query-aware Long Video Localization and Relation Discrimination for Deep Video Understanding-Reference-Cited by-同舟云学术

Query-aware Long Video Localization and Relation Discrimination for Deep Video Understanding

Published:2023-10-26 Issue: Volume: Page:
ISSN:
Container-title:Proceedings of the 31st ACM International Conference on Multimedia
language:
Short-container-title:

Author:

Xu Yuanxing¹^ORCID,Wei Yuting¹^ORCID,Wu Bin¹^ORCID

Affiliation:

1. Beijing University of Posts and Telecommunications, Beijing, China

Funder

NSFC-General Technology Basic Research Joint Funds

National Natural Science Foundation of China

Publisher

ACM

Link

https://dl.acm.org/doi/pdf/10.1145/3581783.3612871

Reference32 articles.

1. TallFormer: Temporal Action Localization with a Long-Memory Transformer

2. Alexey Dosovitskiy Lucas Beyer Alexander Kolesnikov Dirk Weissenborn Xiaohua Zhai Thomas Unterthiner Mostafa Dehghani Matthias Minderer Georg Heigold Sylvain Gelly etal 2020. An image is worth 16x16 words: Transformers for image recognition at scale. arXiv preprint arXiv:2010.11929 (2020). Alexey Dosovitskiy Lucas Beyer Alexander Kolesnikov Dirk Weissenborn Xiaohua Zhai Thomas Unterthiner Mostafa Dehghani Matthias Minderer Georg Heigold Sylvain Gelly et al. 2020. An image is worth 16x16 words: Transformers for image recognition at scale. arXiv preprint arXiv:2010.11929 (2020).

3. Danny Driess , Fei Xia , Mehdi SM Sajjadi , Corey Lynch, Aakanksha Chowdhery, Brian Ichter, Ayzaan Wahid, Jonathan Tompson, Quan Vuong, Tianhe Yu, et al. 2023 . Palm-e : An embodied multimodal language model. arXiv preprint arXiv:2303.03378 (2023). Danny Driess, Fei Xia, Mehdi SM Sajjadi, Corey Lynch, Aakanksha Chowdhery, Brian Ichter, Ayzaan Wahid, Jonathan Tompson, Quan Vuong, Tianhe Yu, et al. 2023. Palm-e: An embodied multimodal language model. arXiv preprint arXiv:2303.03378 (2023).

4. Env-QA: A Video Question Answering Benchmark for Comprehensive Understanding of Dynamic Environments

5. Motion-Appearance Co-memory Networks for Video Question Answering