1. Visual Semantic Search: Retrieving Videos via Complex Textual Queries
2. Use what you have: Video retrieval using representations from collaborative experts;liu,2019
3. Univl: A unified video and language pre-training model for multimodal understanding and generation;luo,2020
4. Clip4clip: An empirical study of clip for end to end video clip retrieval;luo,2021
5. Efficient estimation of word representations in vector space;mikolov,2013