1. MRMI-TTS: Multi-Reference Audios and Mutual Information Driven Zero-Shot Voice Cloning;ACM Transactions on Asian and Low-Resource Language Information Processing;2024-05-10
2. Mels-Tts : Multi-Emotion Multi-Lingual Multi-Speaker Text-To-Speech System Via Disentangled Style Tokens;ICASSP 2024 - 2024 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP);2024-04-14
3. METTS: Multilingual Emotional Text-to-Speech by Cross-Speaker and Cross-Lingual Emotion Transfer;IEEE/ACM Transactions on Audio, Speech, and Language Processing;2024
4. DQR-TTS: Semi-supervised Text-to-speech Synthesis with Dynamic Quantized Representation;2023 IEEE Intl Conf on Parallel & Distributed Processing with Applications, Big Data & Cloud Computing, Sustainable Computing & Communications, Social Computing & Networking (ISPA/BDCloud/SocialCom/SustainCom);2023-12-21
5. Cross-Lingual Knowledge Distillation via Flow-Based Voice Conversion for Robust Polyglot Text-to-Speech;Communications in Computer and Information Science;2023-11-13