Affiliation:
1. Dharmsinh Desai University, Nadiad, Gujarat, India
Abstract
We present a novel approach for improving the performance of an End-to-End speech recognition system for the Gujarati language. We follow a deep learning-based approach that includes Convolutional Neural Network, Bi-directional Long Short Term Memory layers, Dense layers, and Connectionist Temporal Classification as a loss function. To improve the performance of the system with the limited size of the dataset, we present a combined language model (Word-level language Model and Character-level language model)-based prefix decoding technique and Bidirectional Encoder Representations from Transformers-based post-processing technique. To gain key insights from our Automatic Speech Recognition (ASR) system, we used the inferences from the system and proposed different analysis methods. These insights help us in understanding and improving the ASR system as well as provide intuition into the language used for the ASR system. We have trained the model on the Microsoft Speech Corpus, and we observe a 5.87% decrease in Word Error Rate (WER) with respect to base-model WER.
Publisher
Association for Computing Machinery (ACM)
Reference57 articles.
1. Applying Convolutional Neural Networks concepts to hybrid NN-HMM model for speech recognition
2. Dario Amodei Rishita Anubhai Eric Battenberg Carl Case Jared Casper Bryan Catanzaro Jingdong Chen Mike Chrzanowski Adam Coates Greg Diamos Erich Elsen Jesse H. Engel Linxi Fan Christopher Fougner Tony Han Awni Y. Hannun Billy Jun Patrick LeGresley Libby Lin Sharan Narang Andrew Y. Ng Sherjil Ozair Ryan Prenger Jonathan Raiman Sanjeev Satheesh David Seetapun Shubho Sengupta Yi Wang Zhiqian Wang and Chong Wang. 2015. Deep speech 2: End-to-end speech recognition in English and Mandarin. Retrieved from http://arxiv.org/abs/1512.02595.
3. Speech Analysis and Synthesis by Linear Prediction of the Speech Wave
4. The DRAGON system--An overview
5. Automatic speech recognition for under-resourced languages: A survey
Cited by
8 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献