1. Emotion Recognition in Speech using Cross-Modal Transfer in the Wild
2. Galen Andrew , Raman Arora , Jeff Bilmes , and Karen Livescu . 2013 . Deep canonical correlation analysis . In International conference on machine learning. 1247--1255 . Galen Andrew, Raman Arora, Jeff Bilmes, and Karen Livescu. 2013. Deep canonical correlation analysis. In International conference on machine learning. 1247--1255.
3. Yunjey Choi , Min-Je Choi , Munyoung Kim , Jung-Woo Ha , Sunghun Kim , and Jaegul Choo . 2018 . StarGAN: Unified Generative Adversarial Networks for Multi-Domain Image-to-Image Translation. In 2018 IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2018 , Salt Lake City, UT, USA, June 18--22 , 2018. 8789--8797. Yunjey Choi, Min-Je Choi, Munyoung Kim, Jung-Woo Ha, Sunghun Kim, and Jaegul Choo. 2018. StarGAN: Unified Generative Adversarial Networks for Multi-Domain Image-to-Image Translation. In 2018 IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2018, Salt Lake City, UT, USA, June 18--22, 2018. 8789--8797.
4. Yunjey Choi , Youngjung Uh , Jaejun Yoo , and Jung-Woo Ha. 2019. StarGAN v2: Diverse Image Synthesis for Multiple Domains. CoRR , Vol. abs/ 1912 .0 1865 (2019). Yunjey Choi, Youngjung Uh, Jaejun Yoo, and Jung-Woo Ha. 2019. StarGAN v2: Diverse Image Synthesis for Multiple Domains. CoRR, Vol. abs/1912.01865 (2019).
5. Joon Son Chung Arsha Nagrani and Andrew Zisserman. 2018. VoxCeleb2: Deep Speaker Recognition. In Interspeech. 1086--1090. Joon Son Chung Arsha Nagrani and Andrew Zisserman. 2018. VoxCeleb2: Deep Speaker Recognition. In Interspeech. 1086--1090.