1. Wang B, Yang Y , Xu X , Hanjalic A , Shen HT (2017) Adversarial cross-modal retrieval. In: Proceedings of the 25th ACM international conference on multimedia, pp 154–162
2. Faghri F , Fleet DJ , Kiros JR , Fidler S (2017) Vse++: Improving visual-semantic embeddings with hard negatives. arXiv preprint arXiv:1707.05612
3. Dutton B (2020) Adversarial canonical correlation analysis. arXiv preprint arXiv:2005.10349
4. Kiros R , Salakhutdinov R , Zemel RS (2014) Unifying visual-semantic embeddings with multimodal neural language models. arXiv preprint arXiv:1411.2539
5. Simonyan K , Zisserman A (2014) Very deep convolutional networks for large-scale image recognition. arXiv preprint arXiv:1409.1556