Feature Specific Hybrid Framework on composition of Deep learning architecture for speech emotion recognition-Reference-Cited by-同舟云学术

Feature Specific Hybrid Framework on composition of Deep learning architecture for speech emotion recognition

Published:2021-05-01 Issue:1 Volume:1916 Page:012094
ISSN:1742-6588
Container-title:Journal of Physics: Conference Series
language:
Short-container-title:J. Phys.: Conf. Ser.

Author:

Hussain Mansoor,Abishek S,Ashwanth K P,Bharanidharan C,Girish S

Abstract

Abstract Speech cues may be used to identify human emotions using deep learning model of speech emotion recognition using supervised learning or unsupervised learning as machine learning concepts, and then it build the speech emotion databases for test data prediction. Despite of many advantageous, still it suffers from accuracy and other aspects. In order to mitigate those issues, we propose a new feature specific hybrid framework on composition of deep learning architecture such as recurrent neural network and convolution neural network for speech emotion recognition. It analyses different characteristics to make a better description of speech emotion. Initially it uses feature extraction technique using bag-of-Audio-word model to Mel-frequency cepstral factor characteristics and a pack of acoustic words composed of emotion features to feed the hybrid deep learning architecture to result in high classification and prediction accuracy. In addition, the proposed hybrid networks’ output is concatenated and loaded into this layer of softmax, which produces a for speech recognition, a categorical classification statistic is used. The proposed model is based on the Ryerson Audio-Visual Database of Emotional Speech and Song audio (RAVDESS) dataset, which comprises eight emotional groups. Experimental results on dataset prove that proposed framework performs better in terms of 89.5% recognition rate and 98% accuracy against state of art approaches.

Publisher

IOP Publishing

Subject

General Physics and Astronomy

Link

https://iopscience.iop.org/article/10.1088/1742-6596/1916/1/012094/pdf

Reference10 articles.

1. Deep Learning in Object Detection and Recognition

2. Survey on speech emotion recognition;Ayadi;Features, classification schemes, and databases, Pattern Recognit.,2011

3. Learning Temporal Clusters Using Capsule Routing for Speech Emotion Recognition

4. Acoustical properties of speech as indicators of depression and suicidal risk;France;IEEE Trans. Biomed. Eng.,2000

5. A fast learning algorithm for deep belief nets;Hinton;Neural Comput.,2006

Cited by 2 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Application of Deep Learning in Chinese Speech Recognition System;Lecture Notes on Data Engineering and Communications Technologies;2024

2. Deep Learning Algorithm Composition System Based on Music Score Recognition;2022 International Conference on Knowledge Engineering and Communication Systems (ICKES);2022-12-28