Distilling Vision-Language Pre-Training to Collaborate with Weakly-Supervised Temporal Action Localization-Reference-Cited by-同舟云学术

Distilling Vision-Language Pre-Training to Collaborate with Weakly-Supervised Temporal Action Localization

Published:2023-06 Issue: Volume: Page:
ISSN:
Container-title:2023 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)
language:
Short-container-title:

Author:

Ju Chen¹,Zheng Kunhao¹,Liu Jinxiang¹,Zhao Peisen²,Zhang Ya¹,Chang Jianlong²,Tian Qi²,Wang Yanfeng¹

Affiliation:

1. CMIC, Shanghai Jiao Tong University

2. Huawei Cloud

Funder

National Key R&D Program of China

STCSM

111 plan

Publisher

IEEE

Link

Reference104 articles.

1. Image-to-word transformation based on dividing and vector quantizing images with words;mori;Proc of the ACM Int Conf on Multimedia,0

4. Temporal action detection with global segmentation mask learning;nag;Proc of Eur Conf Comput Vis,0

5. Weakly-supervised action localization with expectation-maximization multiinstance learning;luo;Proc of Eur Conf Comput Vis,0

Cited by 3 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

3. Temporal Action Localization in the Deep Learning Era: A Survey;IEEE Transactions on Pattern Analysis and Machine Intelligence;2023