1. Controllable Prosody Generation with Partial Inputs;ICASSP 2024 - 2024 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP);2024-04-14
2. Deep learning-based expressive speech synthesis: a systematic review of approaches, challenges, and resources;EURASIP Journal on Audio, Speech, and Music Processing;2024-02-12
3. Invert-Classify: Recovering Discrete Prosody Inputs for Text-To-Speech;2023 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU);2023-12-16
4. HIGNN-TTS: Hierarchical Prosody Modeling With Graph Neural Networks for Expressive Long-Form TTS;2023 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU);2023-12-16
5. Fine-Grained Style Control in VITS-Based Text-to-Speech Synthesis;Computer Applications;2023-12-14