1. TextRS: Deep Bidirectional Triplet Network for Matching Text to Remote Sensing Images;Rahhal A.;Remote Sensing,2020
2. Hui Chen , Guiguang Ding , Xudong Liu , Zijia Lin , Ji Liu , and Jungong Han . 2020 . IMRAM: Iterative matching with recurrent attention memory for cross-modal image-text retrieval . In Proceedings of the IEEE/CVF conference on computer vision and pattern recognition. Computer Vision Foundation / IEEE, 12655–12663 . Hui Chen, Guiguang Ding, Xudong Liu, Zijia Lin, Ji Liu, and Jungong Han. 2020. IMRAM: Iterative matching with recurrent attention memory for cross-modal image-text retrieval. In Proceedings of the IEEE/CVF conference on computer vision and pattern recognition. Computer Vision Foundation / IEEE, 12655–12663.
3. A Deep Semantic Alignment Network for the Cross-Modal Image-Text Retrieval in Remote Sensing
4. Haiwen Diao , Ying Zhang , Lin Ma , and Huchuan Lu . 2021 . Similarity reasoning and filtration for image-text matching . In Proceedings of the AAAI conference on artificial intelligence. AAAI Press, 1218–1226 . Haiwen Diao, Ying Zhang, Lin Ma, and Huchuan Lu. 2021. Similarity reasoning and filtration for image-text matching. In Proceedings of the AAAI conference on artificial intelligence. AAAI Press, 1218–1226.
5. Fartash Faghri , David J. Fleet , Jamie Ryan Kiros , and Sanja Fidler . 2018 . VSE++: Improving Visual-Semantic Embeddings with Hard Negatives . In British Machine Vision Conference 2018. BMVA Press, 12. Fartash Faghri, David J. Fleet, Jamie Ryan Kiros, and Sanja Fidler. 2018. VSE++: Improving Visual-Semantic Embeddings with Hard Negatives. In British Machine Vision Conference 2018. BMVA Press, 12.