1. The limits of the Mean Opinion Score for speech synthesis evaluation;Computer Speech & Language;2024-03
2. VoiceGrad: Non-Parallel Any-to-Many Voice Conversion With Annealed Langevin Dynamics;IEEE/ACM Transactions on Audio, Speech, and Language Processing;2024
3. DisC-VC: Disentangled and F0-Controllable Neural Voice Conversion;2023 Asia Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC);2023-10-31
4. W2VC: WavLM representation based one-shot voice conversion with gradient reversal distillation and CTC supervision;EURASIP Journal on Audio, Speech, and Music Processing;2023-10-28
5. A Recent Survey Paper on Text-To-Speech Systems;International Journal of Advanced Research in Science, Communication and Technology;2023-01-22