Deep Belief Neural Networks and Bidirectional Long-Short Term Memory Hybrid for Speech Recognition-Reference-Cited by-同舟云学术

Deep Belief Neural Networks and Bidirectional Long-Short Term Memory Hybrid for Speech Recognition

Published:2015-06-01 Issue:2 Volume:40 Page:191-195
ISSN:2300-262X
Container-title:Archives of Acoustics
language:
Short-container-title:

Author:

Brocki Łukasz,Marasek Krzysztof

Abstract

Abstract This paper describes a Deep Belief Neural Network (DBNN) and Bidirectional Long-Short Term Memory (LSTM) hybrid used as an acoustic model for Speech Recognition. It was demonstrated by many independent researchers that DBNNs exhibit superior performance to other known machine learning frameworks in terms of speech recognition accuracy. Their superiority comes from the fact that these are deep learning networks. However, a trained DBNN is simply a feed-forward network with no internal memory, unlike Recurrent Neural Networks (RNNs) which are Turing complete and do posses internal memory, thus allowing them to make use of longer context. In this paper, an experiment is performed to make a hybrid of a DBNN with an advanced bidirectional RNN used to process its output. Results show that the use of the new DBNN-BLSTM hybrid as the acoustic model for the Large Vocabulary Continuous Speech Recognition (LVCSR) increases word recognition accuracy. However, the new model has many parameters and in some cases it may suffer performance issues in real-time applications.

Publisher

Walter de Gruyter GmbH

Subject

Acoustics and Ultrasonics

Link

https://www.degruyter.com/view/j/aoa.2015.40.issue-2/aoa-2015-0021/aoa-2015-0021.pdf

Reference6 articles.

1. Long Short - Term Memory;HochreiterS;Neural Computation,1995

2. A fast learning algorithm for deep belief nets;HintonG;Neural Computation,2006

3. A Learning Algorithm for Boltzmann Machines;AckleyD;Cognitive Science,1985

4. Backpropagation through time : what it does and how to do it;WerbosP;IEEE,1987

5. A tutorial on hidden markov models and selected applications in speech recognition pp;RabinerL;IEEE,1989

Cited by 34 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Optimized differential evolution and hybrid deep learning for superior drug-target binding affinity prediction;Alexandria Engineering Journal;2024-11

2. CNN models for Maghrebian accent recognition with SVM silence elimination;Signal, Image and Video Processing;2024-05-01

3. A Novel Deep Learning Language Model with Hybrid-GFX Embedding and Hyperband Search for Opinion Analysis;SN Computer Science;2023-09-29

4. Bio-inspired algorithm-based hyperparameter tuning for drug-target binding affinity prediction in healthcare;Intelligent Decision Technologies;2023-07-12

5. A RTL Implementation of Heterogeneous Machine Learning Network for French Computer Assisted Pronunciation Training;Applied Sciences;2023-05-09