Author:
Lee Seung Hyun,Chi Hyung-gun,Oh Gyeongrok,Byeon Wonmin,Yoon Sang Ho,Park Hyunje,Cho Wonjun,Kim Jinkyu,Kim Sangpil
Funder
National Research Council of Science and Technology
Defense Acquisition Program Administration
Korea Creative Content Agency
Institute for Information Communication Technology Planning and Evaluation
Reference51 articles.
1. Abdal, R., Qin, Y., & Wonka, P. (2019). Image2stylegan: How to embed images into the stylegan latent space?. In Proceedings of the IEEE/CVF international conference on computer vision (pp. 4432–4441).
2. Self-supervised MultiModal versatile networks;Alayrac;NeurIPS,2020
3. See, hear, and read: Deep aligned representations;Aytar,2017
4. Brouwer, H. (2020). Audio-reactive latent interpolations with StyleGAN. In NeurIPS 2020 workshop on machine learning for creativity and design.
5. Chen, L., Srivastava, S., Duan, Z., & Xu, C. (2017). Deep cross-modal audio-visual generation. In Proceedings of the on thematic workshops of ACM multimedia 2017 (pp. 349–357).