1. Max Bain , Arsha Nagrani , Gül Varol , and Andrew Zisserman . 2021. Frozen in Time: A Joint Video and Image Encoder for End-to-End Retrieval. arXiv preprint arXiv:2104.00650 ( 2021 ). Max Bain, Arsha Nagrani, Gül Varol, and Andrew Zisserman. 2021. Frozen in Time: A Joint Video and Image Encoder for End-to-End Retrieval. arXiv preprint arXiv:2104.00650 (2021).
2. Max Bain , Arsha Nagrani , Gül Varol , and Andrew Zisserman . 2022. A CLIP-Hitchhiker's Guide to Long Video Retrieval. arXiv preprint arXiv:2205.08508 ( 2022 ). Max Bain, Arsha Nagrani, Gül Varol, and Andrew Zisserman. 2022. A CLIP-Hitchhiker's Guide to Long Video Retrieval. arXiv preprint arXiv:2205.08508 (2022).
3. Learning with Differentiable Pertubed Optimizers;Berthet Quentin;Advances in Neural Information Processing Systems,2020
4. Cross Modal Retrieval with Querybank Normalisation
5. Revisiting the “Video” in Video-Language Understanding