Automatic Speech Recognition (ASR) Systems for Children: A Systematic Literature Review-Reference-Cited by-同舟云学术

Automatic Speech Recognition (ASR) Systems for Children: A Systematic Literature Review

Published:2022-04-27 Issue:9 Volume:12 Page:4419
ISSN:2076-3417
Container-title:Applied Sciences
language:en
Short-container-title:Applied Sciences

Author:

Bhardwaj Vivek^ORCID,Ben Othman Mohamed Tahar^ORCID,Kukreja Vinay^ORCID,Belkhier Youcef^ORCID,Bajaj Mohit^ORCID,Goud B. Srikanth^ORCID,Rehman Ateeq Ur^ORCID,Shafiq Muhammad^ORCID,Hamam Habib^ORCID

Abstract

Automatic speech recognition (ASR) is one of the ways used to transform acoustic speech signals into text. Over the last few decades, an enormous amount of research work has been done in the research area of speech recognition (SR). However, most studies have focused on building ASR systems based on adult speech. The recognition of children’s speech was neglected for some time, which means that the field of children’s SR research is wide open. Children’s SR is a challenging task due to the large variations in children’s articulatory, acoustic, physical, and linguistic characteristics compared to adult speech. Thus, the field became a very attractive area of research and it is important to understand where the main center of attention is, and what are the most widely used methods for extracting acoustic features, various acoustic models, speech datasets, the SR toolkits used during the recognition process, and so on. ASR systems or interfaces are extensively used and integrated into various real-life applications, such as search engines, the healthcare industry, biometric analysis, car systems, the military, aids for people with disabilities, and mobile devices. A systematic literature review (SLR) is presented in this work by extracting the relevant information from 76 research papers published from 2009 to 2020 in the field of ASR for children. The objective of this review is to throw light on the trends of research in children’s speech recognition and analyze the potential of trending techniques to recognize children’s speech.

Funder

Qassim University

Publisher

MDPI AG

Subject

Fluid Flow and Transfer Processes,Computer Science Applications,Process Chemistry and Technology,General Engineering,Instrumentation,General Materials Science

Link

https://www.mdpi.com/2076-3417/12/9/4419/pdf

Reference97 articles.

1. A systematic literature review of software effort prediction using machine learning methods

2. A survey on automatic speech recognition systems for Portuguese language and its variations

3. A Survey about Databases of Children’s Speech a Survey about Databases of Children’s Speech Dresden University of Technology, Chair for System Theory and Speech Technologyhttps://www.isca-speech.org/archive_v0/archive_papers/interspeech_2013/i13_2410.pdf

4. HTK Speech Recognition Toolkithttp://htk.eng.cam.ac.uk/

5. Overview of the CMUSphinx Toolkithttps://cmusphinx.github.io/wiki/tutorialoverview/

Cited by 19 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Speech and speaker recognition using raw waveform modeling for adult and children’s speech: A comprehensive review;Engineering Applications of Artificial Intelligence;2024-05

2. A Study on Expression Recognition Based on Improved MobileNetV2 Network;2024-01-29

3. Improving Text-Independent Forced Alignment to Support Speech-Language Pathologists with Phonetic Transcription;Sensors;2023-12-06

4. Comparison of Data Augmentation Techniques on Filipino ASR for Children’s Speech;2023 International Conference on Speech Technology and Human-Computer Dialogue (SpeD);2023-10-25

5. End-of-Sentence Token Modeling for Streaming Conformer-Based Korean Children’s Speech Recognition Applied to a Social Robot;2023 IEEE International Conference on Consumer Electronics-Asia (ICCE-Asia);2023-10-23