Acoustic Modeling of Speech Signal using Artificial Neural Network-Reference-Cited by-同舟云学术

Acoustic Modeling of Speech Signal using Artificial Neural Network

Published:2015 Issue: Volume: Page:282-299
ISSN:2327-3453
Container-title:Advances in Systems Analysis, Software Engineering, and High Performance Computing
language:
Short-container-title:

Author:

Sarma Mousmita¹,Sarma Kandarpa Kumar¹^ORCID

Affiliation:

1. Gauhati University, India

Abstract

Acoustic modeling of the sound unit is a crucial component of Automatic Speech Recognition (ASR) system. This is the process of establishing statistical representations for the feature vector sequences for a particular sound unit so that a classifier for the entire sound unit used in the ASR system can be designed. Current ASR systems use Hidden Markov Model (HMM) to deal with temporal variability and Gaussian Mixture Model (GMM) for acoustic modeling. Recently machine learning paradigms have been explored for application in speech recognition domain. In this regard, Multi Layer Perception (MLP), Recurrent Neural Network (RNN) etc. are extensively used. Artificial Neural Network (ANN)s are trained by back propagating the error derivatives and therefore have the potential to learn much better models of nonlinear data. Recently, Deep Neural Network (DNN)s with many hidden layer have been up voted by the researchers and have been accepted to be suitable for speech signal modeling. In this chapter various techniques and works on the ANN based acoustic modeling are described.

Publisher

IGI Global

Reference54 articles.

1. Sub-band-based speech recognition.;H.Bourlard;Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing,1997

2. Connectionist Speech Recognition

3. Brown, P. A. (1987). The Acoustic Modeling Problem in Automatic Speech Recognition [Doctoral dissertation]. School of Computer Science at Carnegie Mellon University.

4. Dahl G. E., Yu D, Deng Li & Acero A. (2012). Context-Dependent Pre-Trained Deep Neural Networks for Large-Vocabulary Speech Recognition. IEEE Transactions on Audio, Speech, and Language Processing. 20(1), 30-42.

5. Maximum likelihood from incomplete data via the EM algorithm.;A. P.Dempster;Journal of the Royal Statistical Society. Series B. Methodological,1977

Cited by 7 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Design and Evaluation of Speech Processing Systems for Meetei/Meitei Mayek;Lecture Notes in Electrical Engineering;2023-10-03

2. Machine and Deep‐Learning Techniques for Text and Speech Processing;Machine Learning Algorithms for Signal and Image Processing;2022-11-18

3. Bangla Natural Language Processing: A Comprehensive Analysis of Classical, Machine Learning, and Deep Learning-Based Methods;IEEE Access;2022

4. Method of constructing and identifying predictive models of human behavior based on information models of non-verbal signals;Procedia Computer Science;2022

5. 3DSRASG: 3D Scene Retrieval and Augmentation Using Semantic Graphs;Progress in Artificial Intelligence;2021