1. Su, Shupeng, Zhisheng Zhong, and Chao Zhang. "Deep joint-semantics reconstructing hashing for large-scale unsupervised cross-modal retrieval." Proceedings of the IEEE/CVF international conference on computer vision. 2019.
2. Shao, Jie, et al. "3view deep canonical correlation analysis for cross-modal retrieval." 2015 Visual Communications and Image Processing (VCIP). IEEE, 2015.
3. "Heterogeneous community question answering via social-aware multi-modal co-attention convolutional matching;Hu Jun;IEEE Transactions on Multimedia,2020
4. Feng, Fangxiang, Xiaojie Wang, and Ruifan Li. "Cross-modal retrieval with correspondence autoencoder." Proceedings of the 22nd ACM international conference on Multimedia. 2014.
5. "Discriminative dictionary learning with common label alignment for cross-modal retrieval;Deng Cheng;IEEE Transactions on Multimedia,2015