Author:
Huang De-An,Buch Shyamal,Dery Lucio,Garg Animesh,Fei-Fei Li,Niebles Juan Carlos
Cited by
50 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献
1. United We Stand, Divided We Fall: UnityGraph for Unsupervised Procedure Learning from Videos;2024 IEEE/CVF Winter Conference on Applications of Computer Vision (WACV);2024-01-03
2. Video Referring Expression Comprehension via Transformer with Content-conditioned Query;Proceedings of the 1st International Workshop on Deep Multimodal Learning for Information Retrieval;2023-10-29
3. Temporal Sentence Grounding in Videos: A Survey and Future Directions;IEEE Transactions on Pattern Analysis and Machine Intelligence;2023-08
4. Shot Retrieval and Assembly with Text Script for Video Montage Generation;Proceedings of the 2023 ACM International Conference on Multimedia Retrieval;2023-06-12
5. Meta-Personalizing Vision-Language Models to Find Named Instances in Video;2023 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR);2023-06