Author:
Zhou Jian,Liu Jiahui,Fan Cunhang,Zheng Wenming,Lv Zhao,Tao Liang,Kwan Hon Keung
Reference51 articles.
1. Effective and direct control of neural tts prosody by removing interactions between different attributes;X An;Neural Networks,2021
2. Deep learning-based expressive speech synthesis: a systematic review of approaches, challenges, and resources;H Barakat;EURASIP JOURNAL ON AUDIO SPEECH AND MUSIC PROCESSING,2024
3. Fine-grained style control in transformerbased text-to-speech synthesis;L W Chen;ICASSP 2022 -2022 IEEE International Conference on Acoustics, Speech and Signal Processing,2022
4. Optimizing feature fusion for improved zero-shot adaptation in text-to-speech synthesis;Z Chen;EURASIP JOURNAL ON AUDIO SPEECH AND MUSIC PROCESS-ING,2024