A Situational Analysis of Current Speech-Synthesis Systems for Child Voices: A Scoping Review of Qualitative and Quantitative Evidence-Reference-Cited by-同舟云学术

A Situational Analysis of Current Speech-Synthesis Systems for Child Voices: A Scoping Review of Qualitative and Quantitative Evidence

Published:2022-06-01 Issue:11 Volume:12 Page:5623
ISSN:2076-3417
Container-title:Applied Sciences
language:en
Short-container-title:Applied Sciences

Author:

Terblanche Camryn^ORCID,Harty Michal,Pascoe Michelle^ORCID,Tucker Benjamin V.^ORCID

Abstract

(1) Background: Speech synthesis has customarily focused on adult speech, but with the rapid development of speech-synthesis technology, it is now possible to create child voices with a limited amount of child-speech data. This scoping review summarises the evidence base related to developing synthesised speech for children. (2) Method: The included studies were those that were (1) published between 2006 and 2021 and (2) included child participants or voices of children aged between 2–16 years old. (3) Results: 58 studies were identified. They were discussed based on the languages used, the speech-synthesis systems and/or methods used, the speech data used, the intelligibility of the speech and the ages of the voices. Based on the reviewed studies, relative to adult-speech synthesis, developing child-speech synthesis is notably more challenging. Child speech often presents with acoustic variability and articulatory errors. To account for this, researchers have most often attempted to adapt adult-speech models, using a variety of different adaptation techniques. (4) Conclusions: Adapting adult speech has proven successful in child-speech synthesis. It appears that the resulting quality can be improved by training a large amount of pre-selected speech data, aided by a neural-network classifier, to better match the children’s speech. We encourage future research surrounding individualised synthetic speech for children with CCN, with special attention to children who make use of low-resource languages.

Publisher

MDPI AG

Subject

Fluid Flow and Transfer Processes,Computer Science Applications,Process Chemistry and Technology,General Engineering,Instrumentation,General Materials Science

Link

https://www.mdpi.com/2076-3417/12/11/5623/pdf

Reference85 articles.

1. Effects of AAC interventions on communication and language for young children with complex communication needs

2. Building personalised synthetic voices for individuals with severe speech impairment

3. Speech synthesis technologies for individuals with vocal disabilities: Voice banking and reconstruction

4. Towards Personalized Speech Synthesis for Augmentative and Alternative Communication

Cited by 3 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. The development of synthetic child speech in three South African languages;Augmentative and Alternative Communication;2024-07-11

2. Automated Child Voice Generation: Methodology and Implementation;2023 International Conference on Speech Technology and Human-Computer Dialogue (SpeD);2023-10-25

3. Special Issue on Applications of Speech and Language Technologies in Healthcare;Applied Sciences;2023-06-05