Discriminative Training Using Noise Robust Integrated Features and Refined HMM Modeling-Reference-Cited by-同舟云学术

Discriminative Training Using Noise Robust Integrated Features and Refined HMM Modeling

Published:2018-02-20 Issue:1 Volume:29 Page:327-344
ISSN:2191-026X
Container-title:Journal of Intelligent Systems
language:
Short-container-title:

Author:

Dua Mohit¹,Aggarwal Rajesh Kumar¹,Biswas Mantosh¹

Affiliation:

1. Department of Computer Engineering, National Institute of Technology, Kurukshetra, India

Abstract

Abstract The classical approach to build an automatic speech recognition (ASR) system uses different feature extraction methods at the front end and various parameter classification techniques at the back end. The Mel-frequency cepstral coefficients (MFCC) and perceptual linear prediction (PLP) techniques are the conventional approaches used for many years for feature extraction, and the hidden Markov model (HMM) has been the most obvious selection for feature classification. However, the performance of MFCC-HMM and PLP-HMM-based ASR system degrades in real-time environments. The proposed work discusses the implementation of discriminatively trained Hindi ASR system using noise robust integrated features and refined HMM model. It sequentially combines MFCC with PLP and MFCC with gammatone-frequency cepstral coefficient (GFCC) to obtain MF-PLP and MF-GFCC integrated feature vectors, respectively. The HMM parameters are refined using genetic algorithm (GA) and particle swarm optimization (PSO). Discriminative training of acoustic model using maximum mutual information (MMI) and minimum phone error (MPE) is preformed to enhance the accuracy of the proposed system. The results show that discriminative training using MPE with MF-GFCC integrated feature vector and PSO-HMM parameter refinement gives significantly better results than the other implemented techniques.

Publisher

Walter de Gruyter GmbH

Subject

Artificial Intelligence,Information Systems,Software

Link

https://www.degruyter.com/document/doi/10.1515/jisys-2017-0618/pdf

Reference92 articles.

1. Large scale discriminative training of hidden Markov models for speech recognition;Comput. Speech Lang.,2002

2. Hybrid wavelet based LPC features for Hindi speech recognition;Int. J. Inf. Commun. Technol.,2008

3. A heterogeneous speech feature vectors generation approach with hybrid hmm classifiers;Int. J. Speech Technol.,2017

4. Large-vocabulary continuous speech recognition systems: a look at some recent advances;IEEE Signal Process. Mag.,2012

5. New front end based on multitaper and gammatone filters for robust speaker verification,2017

Cited by 24 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Feature extraction using GTCC spectrogram and ResNet50 based classification for audio spoof detection;International Journal of Speech Technology;2024-03

2. A review on Gujarati language based automatic speech recognition (ASR) systems;International Journal of Speech Technology;2024-03

3. An amalgamation of integrated features with DeepSpeech2 architecture and improved spell corrector for improving Gujarati language ASR system;International Journal of Speech Technology;2024-02-13

4. Enhancing Performance of Noise-Robust Gujarati Language ASR Utilizing the Hybrid Acoustic Model and Combined MFCC + GTCC Feature;Lecture Notes in Networks and Systems;2024

5. Gaussian-Filtered High-Frequency-Feature Trained Optimized BiLSTM Network for Spoofed-Speech Classification;Sensors;2023-07-24