1. PHO-LID: A Unified Model Incorporating Acoustic-Phonetic and Phonotactic Information for Language Identification
2. Voicemixer: Adversarial voice style mixup;Lee;Advances in Neural Information Processing Systems,2021
3. Montreal Forced Aligner: Trainable Text-Speech Alignment Using Kaldi
4. Fastspeech: Fast, robust and controllable text to speech;Ren;Advances in Neural Information Processing Systems,2019
5. Phoneme alignment based on discriminative learning