1. Flowavenet: A generative flow for raw audio;Kim
2. Auto-encoding variational bayes;Kingma
3. Vara-tts: Non-autoregressive text-to-speech synthesis based on very deep vae with residual attention;Liu,2021
4. High fidelity speech synthesis with adversarial networks;Binkowski