Emotion Recognition in Hindi Language using Gender Information, GMFCC, DMFCC and Deep LSTM-Reference-Cited by-同舟云学术

Emotion Recognition in Hindi Language using Gender Information, GMFCC, DMFCC and Deep LSTM

Published:2021-08-01 Issue:1 Volume:1950 Page:012049
ISSN:1742-6588
Container-title:Journal of Physics: Conference Series
language:
Short-container-title:J. Phys.: Conf. Ser.

Author:

Kumar Sandeep,Yadav Jainath

Abstract

Abstract Long Short-Term Memory (LSTM) captures long-term dependencies accurately than other types of neural networks, and it is frequently used in deep learning. In this work, we have explored Deep LSTM with a dropout layer that minimizes the training overfitting. We have considered IITKGP-SEHSC emotional dataset for emotion recognition. We only deal with five types of emotions, namely angry, fear, happy, neutral, and sad emotions recorded from male and female speech. Since the IITKGP-SEHSC dataset is monolingual that means only spectral features are sufficient for emotion recognition. Traditional MFCC deals with low-frequency information. Here, we have explored two features, namely Gammatone Mel Frequency Cepstral Coefficient (GMFCC) and Discrete wavelet Mel Frequency Cepstral Coefficient (DMFCC). GMFCC deals with basilar membrane displacement obtained from the gammatone filter, and it is useful for recognizing gender from emotional speech. DMFCC deals with MFCC analysis on the high-frequency components of speech rather than the low-frequency components. In the proposed work, DMFCC has been explored for recognizing emotions from speech. The average accuracy of gender classification with Deep LSTM and GMFCC is 98.3%. The average emotion recognition rate with Deep LSTM and DMFCC is 92% and 88.7% individually for male speech and female speech, respectively. Our proposed model is built by combining the above sub-models, and it gives emotion recognition accuracy of 91.2% for male speech and 87.6% for female speech, respectively.

Publisher

IOP Publishing

Subject

General Physics and Astronomy

Link

https://iopscience.iop.org/article/10.1088/1742-6596/1950/1/012049/pdf

Reference13 articles.

1. Survey on speech emotion recognition: Features, classification schemes, and databases;El Ayadi;Pattern Recognition,2011

2. Emotion recognition in speech using neural networks;Nicholson;Neural computing & applications,2000

3. Prosody as a compensatory strategy in the conversations of people with agrammatism;Beeke;Clinical linguistics & phonetics,2009

4. Speech emotion recognition using deep neural network and extreme learning machine;Han,2014

Cited by 3 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Speech Emotion Recognition: A Comprehensive Survey;Wireless Personal Communications;2023-03-08

2. Speech emotion recognition in Hindi: Review paper;AIP Conference Proceedings;2023

3. A Review of the Advancement in Speech Emotion Recognition for Indo-Aryan and Dravidian Languages;Advances in Human-Computer Interaction;2022-12-01