1. Cross-modal subspace learning via pairwise constraints;He;IEEE TIP,2015
2. CCL: Cross-modal correlation learning with multigrained fusion by hierarchical network;Peng;IEEE TMM,2017
3. H. Wang, D. Sahoo, C. Liu, E.-p. Lim, S.C. Hoi, Learning cross-modal embeddings with adversarial networks for cooking recipes and food images, in: IEEE CVPR, 2019, pp. 11572–11581.
4. H. Alwassel, D. Mahajan, B. Korbar, L. Torresani, B. Ghanem, D. Tran, Self-supervised learning by cross-modal audio-video clustering, in: NeurIPS, Vol. 33, 2020, pp. 9758–9770.
5. M.H. Coen, Cross-modal clustering, in: AAAI, 2005, pp. 932–937.