Unlocking the Power of Cross-Dimensional Semantic Dependency for Image-Text Matching-Reference-Cited by-同舟云学术

Unlocking the Power of Cross-Dimensional Semantic Dependency for Image-Text Matching

Published:2023-10-26 Issue: Volume: Page:
ISSN:
Container-title:Proceedings of the 31st ACM International Conference on Multimedia
language:
Short-container-title:

Author:

Zhang Kun¹^ORCID,Zhang Lei¹^ORCID,Hu Bo¹^ORCID,Zhu Mengxiao¹^ORCID,Mao Zhendong¹^ORCID

Affiliation:

1. University of Science and Technology of China, Hefei, China

Funder

National Natural Science Foundation of China

National Key Research and Development Project of China

Publisher

ACM

Link

https://dl.acm.org/doi/pdf/10.1145/3581783.3611703

Reference61 articles.

1. Peter Anderson Xiaodong He Chris Buehler Damien Teney Mark Johnson Stephen Gould and Lei Zhang. 2018. Bottom-up and top-down attention for image captioning and visual question answering. In CVPR. 6077--6086. Peter Anderson Xiaodong He Chris Buehler Damien Teney Mark Johnson Stephen Gould and Lei Zhang. 2018. Bottom-up and top-down attention for image captioning and visual question answering. In CVPR. 6077--6086.

2. Mikhail Belkin Siyuan Ma and Soumik Mandal. 2018. To understand deep learning we need to understand kernel learning. In ICML. PMLR 541--549. Mikhail Belkin Siyuan Ma and Soumik Mandal. 2018. To understand deep learning we need to understand kernel learning. In ICML. PMLR 541--549.

3. Hui Chen , Guiguang Ding , Xudong Liu , Zijia Lin , Ji Liu , and Jungong Han . 2020 b. Imram: Iterative matching with recurrent attention memory for cross-modal image-text retrieval. In CVPR. 12655--12663. Hui Chen, Guiguang Ding, Xudong Liu, Zijia Lin, Ji Liu, and Jungong Han. 2020b. Imram: Iterative matching with recurrent attention memory for cross-modal image-text retrieval. In CVPR. 12655--12663.

4. Jiacheng Chen Hexiang Hu Hao Wu Yuning Jiang and Changhu Wang. 2021. Learning the best pooling strategy for visual semantic embedding. In CVPR. 15789--15798. Jiacheng Chen Hexiang Hu Hao Wu Yuning Jiang and Changhu Wang. 2021. Learning the best pooling strategy for visual semantic embedding. In CVPR. 15789--15798.

5. Tianlang Chen Jiajun Deng and Jiebo Luo. 2020a. Adaptive offline quintuplet loss for image-text matching. In ECCV. 549--565. Tianlang Chen Jiajun Deng and Jiebo Luo. 2020a. Adaptive offline quintuplet loss for image-text matching. In ECCV. 549--565.