1. METTS: Multilingual Emotional Text-to-Speech by Cross-Speaker and Cross-Lingual Emotion Transfer;IEEE/ACM Transactions on Audio, Speech, and Language Processing;2024
2. Accent-VITS: Accent Transfer for End-to-End TTS;Communications in Computer and Information Science;2024
3. Nonparallel Expressive TTS for Unseen Target Speaker using Style-Controlled Adaptive Layer and Optimized Pitch Embedding;2023 International Conference on Speech Technology and Human-Computer Dialogue (SpeD);2023-10-25
4. Prosody-Aware Speecht5 for Expressive Neural TTS;ICASSP 2023 - 2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP);2023-06-04
5. Multi-Speaker Expressive Speech Synthesis via Multiple Factors Decoupling;ICASSP 2023 - 2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP);2023-06-04