1. CopyCat: Many-to-Many Fine-Grained Prosody Transfer for Neural Text-to-Speech
2. wav2vec 2.0: A framework for self-supervised learning of speech representations;baevski;Advances in neural information processing systems,2020
3. fairseq: A Fast, Extensible Toolkit for Sequence Modeling
4. High fidelity speech synthesis with adversarial networks;bi?kowski;International Conference on Learning Representations,2020
5. Neural discrete representation learning;van den oord;Advances in neural information processing systems,2017