1. Andrew G,Arora R,Bilmes J and Livescu K. 2013. Deep canonical correlation analysis//Proceedings of the 30th International Conference on Machine Learning. Atlanta,USA:JMLR.org:1247-1255
2. Arjovsky M,Chintala S and Bottou L. 2017. Wasserstein generative adversarial networks//Proceedings of the 34th International Conference on Machine Learning. Sydney,Australia:JMLR.org:214-223
3. Baltrušaitis T,Ahuja C and Morency L P. 2019. Multimodal machine learning:a survey and taxonomy. IEEE Transactions on Pattern Analysis and Machine Intelligence,41(2):423-443[DOI:10. 1109/TPAMI.2018.2798607]
4. Brown T B,Mann B,Ryder N,Subbiah M,Kaplan J,Dhariwal P,Neelakantan A, Shyam P, Sastry G, Askell A, Agarwal S,Herbert-Voss A,Krueger G,Henighan T,Child R,Ramesh A,Ziegler D M,Wu J,Winter C,Hesse C,Chen M,Sigler E,Litwin M,Gray S,Chess B,Clark J,Berner C,McCandlish S,Radford A,Sutskever I and Amodei D. 2020. Language models are fewshot learners//Proceedings of the 34th International Conference on Neural Information Processing Systems. Vancouver,Canada:Curran Associates Inc.:1877-1901
5. Cao Y,Long M S,Wang J M,Yang Q and Yu P S. 2016. Deep visualsemantic hashing for cross-modal retrieval//Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. San Francisco,USA:ACM:1445-1454[DOI,10.1145/2939672.2939812]