Affiliation:
1. University of Science and Technology of China, Hefei, China
Funder
National Natural Science Foundation of China
National Key Research and Development Project of China
Reference61 articles.
1. Peter Anderson Xiaodong He Chris Buehler Damien Teney Mark Johnson Stephen Gould and Lei Zhang. 2018. Bottom-up and top-down attention for image captioning and visual question answering. In CVPR. 6077--6086. Peter Anderson Xiaodong He Chris Buehler Damien Teney Mark Johnson Stephen Gould and Lei Zhang. 2018. Bottom-up and top-down attention for image captioning and visual question answering. In CVPR. 6077--6086.
2. Mikhail Belkin Siyuan Ma and Soumik Mandal. 2018. To understand deep learning we need to understand kernel learning. In ICML. PMLR 541--549. Mikhail Belkin Siyuan Ma and Soumik Mandal. 2018. To understand deep learning we need to understand kernel learning. In ICML. PMLR 541--549.
3. Hui Chen , Guiguang Ding , Xudong Liu , Zijia Lin , Ji Liu , and Jungong Han . 2020 b. Imram: Iterative matching with recurrent attention memory for cross-modal image-text retrieval. In CVPR. 12655--12663. Hui Chen, Guiguang Ding, Xudong Liu, Zijia Lin, Ji Liu, and Jungong Han. 2020b. Imram: Iterative matching with recurrent attention memory for cross-modal image-text retrieval. In CVPR. 12655--12663.
4. Jiacheng Chen Hexiang Hu Hao Wu Yuning Jiang and Changhu Wang. 2021. Learning the best pooling strategy for visual semantic embedding. In CVPR. 15789--15798. Jiacheng Chen Hexiang Hu Hao Wu Yuning Jiang and Changhu Wang. 2021. Learning the best pooling strategy for visual semantic embedding. In CVPR. 15789--15798.
5. Tianlang Chen Jiajun Deng and Jiebo Luo. 2020a. Adaptive offline quintuplet loss for image-text matching. In ECCV. 549--565. Tianlang Chen Jiajun Deng and Jiebo Luo. 2020a. Adaptive offline quintuplet loss for image-text matching. In ECCV. 549--565.