A Study on a Speech Emotion Recognition System with Effective Acoustic Features Using Deep Learning Algorithms-Reference-Cited by-同舟云学术

A Study on a Speech Emotion Recognition System with Effective Acoustic Features Using Deep Learning Algorithms

Published:2021-02-21 Issue:4 Volume:11 Page:1890
ISSN:2076-3417
Container-title:Applied Sciences
language:en
Short-container-title:Applied Sciences

Author:

Byun Sung-Woo,Lee Seok-Pil^ORCID

Abstract

The goal of the human interface is to recognize the user’s emotional state precisely. In the speech emotion recognition study, the most important issue is the effective parallel use of the extraction of proper speech features and an appropriate classification engine. Well defined speech databases are also needed to accurately recognize and analyze emotions from speech signals. In this work, we constructed a Korean emotional speech database for speech emotion analysis and proposed a feature combination that can improve emotion recognition performance using a recurrent neural network model. To investigate the acoustic features, which can reflect distinct momentary changes in emotional expression, we extracted F0, Mel-frequency cepstrum coefficients, spectral features, harmonic features, and others. Statistical analysis was performed to select an optimal combination of acoustic features that affect the emotion from speech. We used a recurrent neural network model to classify emotions from speech. The results show the proposed system has more accurate performance than previous studies.

Funder

National Research Foundation of Korea

Publisher

MDPI AG

Subject

Fluid Flow and Transfer Processes,Computer Science Applications,Process Chemistry and Technology,General Engineering,Instrumentation,General Materials Science

Link

https://www.mdpi.com/2076-3417/11/4/1890/pdf

Reference59 articles.

1. Survey on speech emotion recognition: Features, classification schemes, and databases

2. A Comparison of Effective Feature Vectors for Speech Emotion Recognition;Shin;Trans. Korean Inst. Electr. Eng.,2018

3. Speech Emotion Recognition Using Deep Convolutional Neural Network and Discriminant Temporal Pyramid Matching

4. A machine learning model for emotion recognition from physiological signals

Cited by 22 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Multi-Label Emotion Recognition of Korean Speech Data Using Deep Fusion Models;Applied Sciences;2024-08-28

2. CNN-Based Models for Emotion and Sentiment Analysis Using Speech Data;ACM Transactions on Asian and Low-Resource Language Information Processing;2024-08-08

3. Automatic Age and Gender Recognition Using Ensemble Learning;Applied Sciences;2024-08-06

4. Teager Energy-Autocorrelation Envelope for Stressed Speech Emotion Recognition with Spectral Features: A Multi-database Analysis;Wireless Personal Communications;2024-07-20

5. Non-speech emotion recognition based on back propagation feed forward networks;Journal of Intelligent & Fuzzy Systems;2024-04-18