1. Style tokens: Unsupervised style modeling, control and transfer in end-to-end speech synthesis;wang;International Conference on Machine Learning (ICML),2018
2. Fastspeech 2: Fast and high-quality end-to-end text to speech;ren;International Conference on Learning Representations (ICLR),2021
3. Token-Level Ensemble Distillation for Grapheme-to-Phoneme Conversion
4. Bert: Pre-training of deep bidirectional transformers for language understanding;devlin;Annual conference of the North American Chapter of the Association for Computational Linguistics Human Language Technologies (NAACL),2019
5. An objective measure for estimating mos of synthesized speech;min;Conference of the International Speech Communication Association (InterSpeech),2001