Controllable speech synthesis by learning discrete phoneme-level prosodic representations-Reference-Cited by-同舟云学术

Controllable speech synthesis by learning discrete phoneme-level prosodic representations

Published:2023-01 Issue: Volume:146 Page:22-31
ISSN:0167-6393
Container-title:Speech Communication
language:en
Short-container-title:Speech Communication

Author:

Ellinas Nikolaos,Christidou Myrsini,Vioni Alexandra,Sung June Sig,Chalamandaris Aimilios,Tsiakoulis Pirros,Mastorocostas Paris

Publisher

Elsevier BV

Subject

Computer Science Applications,Computer Vision and Pattern Recognition,Linguistics and Language,Language and Linguistics,Communication,Modeling and Simulation,Software

Reference65 articles.

1. Angelini, O., Moinet, A., Yanagisawa, K., Drugman, T., 2020. Singing Synthesis: With a Little Help from my Attention. In: Proc. Interspeech.

2. Baevski, A., Zhou, Y., Mohamed, A., Auli, M., 2020. wav2vec 2.0: A Framework for Self-Supervised Learning of Speech Representations. In: Proc. NeurIPS. Vol. 33.

3. Bak, T., Bae, J.-S., Bae, H., Kim, Y.-I., Cho, H.-Y., 2021. FastPitchFormant: Source-Filter Based Decomposed Modeling for Speech Synthesis. In: Proc. Interspeech. pp. 116–120.

4. Effective use of variational embedding capacity in expressive end-to-end speech synthesis;Battenberg,2019

5. Location-relative attention mechanisms for robust long-form speech synthesis;Battenberg,2020

Cited by 2 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Fine-Grained Prosody Transfer Text-to-Speech Synthesis with Transformer;2024 5th International Conference in Electronic Engineering, Information Technology & Education (EEITE);2024-05-29

2. Deep learning-based expressive speech synthesis: a systematic review of approaches, challenges, and resources;EURASIP Journal on Audio, Speech, and Music Processing;2024-02-12