A Straightforward Framework for Video Retrieval Using CLIP-Reference-Cited by-同舟云学术

A Straightforward Framework for Video Retrieval Using CLIP

Published:2021 Issue: Volume: Page:3-12
ISSN:0302-9743
Container-title:Lecture Notes in Computer Science
language:
Short-container-title:

Author:

Portillo-Quintero Jesús Andrés^ORCID,Ortiz-Bayliss José Carlos^ORCID,Terashima-Marín Hugo^ORCID

Publisher

Springer International Publishing

Link

https://link.springer.com/content/pdf/10.1007/978-3-030-77004-4_1

Reference19 articles.

1. Chen, D., Dolan, W.B.: Collecting highly parallel data for paraphrase evaluation. In: Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies, pp. 190–200 (2011)

2. Chen, T., Kornblith, S., Norouzi, M., Hinton, G.: A simple framework for contrastive learning of visual representations. In: International Conference on Machine Learning, pp. 1597–1607. PMLR (2020)

3. Dong, J., Li, X., Snoek, C.G.M.: Predicting visual features from text for image and video caption retrieval. IEEE Trans. Multimed. 20(12), 3377–3388 (2018)

4. Dong, J., et al.: Dual encoding for zero-example video retrieval. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 9346–9355 (2019)

5. Lecture Notes in Computer Science;V Gabeur,2020

Cited by 58 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. CLIP2TF:Multimodal video–text retrieval for adolescent education;Displays;2024-09

2. Lightweight hashing with contrastive self-cross XLNet and combinational similarity matching for content-based video retrieval;Australian Journal of Electrical and Electronics Engineering;2024-08-04

3. Improving semantic video retrieval models by training with a relevance-aware online mining strategy;Computer Vision and Image Understanding;2024-08

4. Building an Open-Vocabulary Video CLIP Model With Better Architectures, Optimization and Data;IEEE Transactions on Pattern Analysis and Machine Intelligence;2024-07

5. Multilevel Semantic Interaction Alignment for Video–Text Cross-Modal Retrieval;IEEE Transactions on Circuits and Systems for Video Technology;2024-07