Funder
National Natural Science Foundation of China
Science and Technology Innovation Foundation of Dalian
Publisher
Springer Science and Business Media LLC
Subject
Computer Networks and Communications,Hardware and Architecture,Media Technology,Software
Reference59 articles.
1. Anderson P, He X, Buehler C, Teney D, Johnson M, Gould S, Zhang L (2018) Bottom-up and top-down attention for image captioning and visual question answering. In: Proceedings of the IEEE Conference on computer vision and pattern recognition, pp 6077–6086
2. Andrew G, Arora R, Bilmes J, Livescu K (2013) Deep canonical correlation analysis. In: Proceedings of the 30th International conference on machine learning, pp 1247–1255
3. Chen Y, Li L, Yu L, El Kholy A, Ahmed F, Gan Z, Cheng Y, Liu J (2020) Uniter: Universal image-text representation learning. In: Proceedings of the 16th European conference on computer vision, pp 104–120
4. Cheng M, Mitra NJ, Huang X, Torr PHS, Hu S (2015) Global contrast based salient region detection. IEEE Trans Pattern Anal Mach Intell 37 (3):569–582
5. Cheng Y, Zhu X, Qian J, Wen F, Liu P (2022) Cross-modal graph matching network for image-text retrieval. ACM Transactions on Multimedia Computing Communications, and Applications (TOMM) 18(4):1–23
Cited by
2 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献
1. SENSE DIFFERENTIATION OF TEXTS AS A COMPONENT OF NEURAL NETWORK MODELLING;Scientific Journal of National Pedagogical Dragomanov University. Series 9. Current Trends in Language Development;2024-06-30
2. Multi-task Collaborative Network for Image-Text Retrieval;Lecture Notes in Computer Science;2024