1. Style Tokens: Unsupervised Style Modeling, Control and Transfer in End-to-End Speech Synthesis;wang;Proc ICML,2018
2. Towards Endto-End Prosody Transfer for Expressive Speech Synthesis with Tacotron;skerry-ryan;Proc ICML,2018
3. Hierarchical Generative Modeling for Controllable Speech Synthesis;hsu;Proc ICLR,2019
4. Learning Latent Representations for Style Control and Transfer in End-to-end Speech Synthesis
5. Multimodal and Multilingual Embeddings for Large-Scale Speech Mining;duquenne;Proc NeruIPS,2021