1. Is An Image Worth Five Sentences? A New Look into Semantics for Image-Text Matching
2. Generating Sentences from a Continuous Space
3. Shizhe Chen , Yida Zhao , Qin Jin , and Qi Wu . 2020 b. Fine-Grained Video-Text Retrieval With Hierarchical Graph Reasoning. In IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR). 10635--10644 . Shizhe Chen, Yida Zhao, Qin Jin, and Qi Wu. 2020b. Fine-Grained Video-Text Retrieval With Hierarchical Graph Reasoning. In IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR). 10635--10644.
4. Adaptive Offline Quintuplet Loss for Image-Text Matching
5. Sanghyuk Chun , Seong Joon Oh , Rafael Sampaio de Rezende , Yannis Kalantidis , and Diane Larlus . 2021 . Probabilistic Embeddings for Cross-Modal Retrieval. In IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR). Sanghyuk Chun, Seong Joon Oh, Rafael Sampaio de Rezende, Yannis Kalantidis, and Diane Larlus. 2021. Probabilistic Embeddings for Cross-Modal Retrieval. In IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).