Dual-Stream Multimodal Learning for Topic-Adaptive Video Highlight Detection-Reference-Cited by-同舟云学术

Dual-Stream Multimodal Learning for Topic-Adaptive Video Highlight Detection

Published:2023-06-12 Issue: Volume: Page:
ISSN:
Container-title:Proceedings of the 2023 ACM International Conference on Multimedia Retrieval
language:
Short-container-title:

Author:

Xiong Ziwei¹^ORCID,Wang Han¹^ORCID

Affiliation:

1. Beijing Forestry University, China

Publisher

ACM

Link

https://dl.acm.org/doi/pdf/10.1145/3591106.3592286

Reference46 articles.

1. Joint Visual and Audio Learning for Video Highlight Detection

2. Contrastive Learning for Unsupervised Video Highlight Detection

3. Tom Brown , Benjamin Mann , Nick Ryder , Melanie Subbiah , Jared D Kaplan , Prafulla Dhariwal , Arvind Neelakantan , Pranav Shyam , Girish Sastry , Amanda Askell , 2020. Language models are few-shot learners. Advances in neural information processing systems 33 ( 2020 ), 1877–1901. Tom Brown, Benjamin Mann, Nick Ryder, Melanie Subbiah, Jared D Kaplan, Prafulla Dhariwal, Arvind Neelakantan, Pranav Shyam, Girish Sastry, Amanda Askell, 2020. Language models are few-shot learners. Advances in neural information processing systems 33 (2020), 1877–1901.

4. Ting Chen , Simon Kornblith , Mohammad Norouzi , and Geoffrey Hinton . 2020 . A simple framework for contrastive learning of visual representations . In International conference on machine learning. PMLR, 1597–1607 . Ting Chen, Simon Kornblith, Mohammad Norouzi, and Geoffrey Hinton. 2020. A simple framework for contrastive learning of visual representations. In International conference on machine learning. PMLR, 1597–1607.

5. Peng Gao , Shijie Geng , Renrui Zhang , Teli Ma , Rongyao Fang , Yongfeng Zhang , Hongsheng Li , and Yu Qiao . 2021 . Clip-adapter: Better vision-language models with feature adapters. arXiv preprint arXiv:2110.04544 (2021). Peng Gao, Shijie Geng, Renrui Zhang, Teli Ma, Rongyao Fang, Yongfeng Zhang, Hongsheng Li, and Yu Qiao. 2021. Clip-adapter: Better vision-language models with feature adapters. arXiv preprint arXiv:2110.04544 (2021).

Cited by 1 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. IMF-MF: Interactive moment localization with adaptive multimodal fusion and self-attention;Journal of Intelligent & Fuzzy Systems;2024-04-04