A Transfer Learning End-to-End Arabic Text-To-Speech (TTS) Deep Architecture-Reference-Cited by-同舟云学术

A Transfer Learning End-to-End Arabic Text-To-Speech (TTS) Deep Architecture

Published:2020 Issue: Volume: Page:266-277
ISSN:0302-9743
Container-title:Artificial Neural Networks in Pattern Recognition
language:
Short-container-title:

Author:

Fahmy Fady K.^ORCID,Khalil Mahmoud I.^ORCID,Abbas Hazem M.^ORCID

Publisher

Springer International Publishing

Link

https://link.springer.com/content/pdf/10.1007/978-3-030-58309-5_22

Reference26 articles.

1. Hunt, A.J., Black, A.W.: Unit selection in a concatenative speech synthesis system using a large speech database. In: 1996 IEEE International Conference on Acoustics, Speech, and Signal Processing Conference Proceedings, pp. 373–376 (1996)

2. Hamon, C., Mouline, E., Charpentier, F.: A diphone synthesis system based on time-domain prosodic modifications of speech. In: International Conference on Acoustics, Speech, and Signal Processing, pp. 238–241 (1989)

3. Tokuda, K., Nankaku, Y., Toda, T., Zen, H., Yamagishi, J., Oura, K.: Speech synthesis based on hidden Markov models. In: Proceedings of the IEEE, pp. 1234–1252 (2013)

4. Yu, K., Young, S.: Continuous F0 modeling for HMM based statistical parametric speech synthesis. In: IEEE Transactions on Audio, Speech, and Language Processing, pp. 1071–1079 (2011)

5. van den Oord, A., et al.: WaveNet: a generative model for raw audio. CoRR arXiv:1609.03499 (2016)

Cited by 11 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Planning the development of text-to-speech synthesis models and datasets with dynamic deep learning;Journal of King Saud University - Computer and Information Sciences;2024-09

2. Central Kurdish Text-to-Speech Synthesis with Novel End-to-End Transformer Training;Algorithms;2024-07-03

3. End-to-End Text-to-Speech Systems in Arabic: A Comparative Study;2024 IEEE 12th International Symposium on Signal, Image, Video and Communications (ISIVC);2024-05-21

4. Advancing Limited Data Text-to-Speech Synthesis: Non-Autoregressive Transformer for High-Quality Parallel Synthesis;2023 International Conference on Speech Technology and Human-Computer Dialogue (SpeD);2023-10-25

5. Deep transfer learning for automatic speech recognition: Towards better generalization;Knowledge-Based Systems;2023-10