A Perspective Study on Speech Emotion Recognition: Databases, Features and Classification Models-Reference-Cited by-同舟云学术

A Perspective Study on Speech Emotion Recognition: Databases, Features and Classification Models

Published:2021-12-31 Issue:6 Volume:38 Page:1861-1873
ISSN:0765-0019
Container-title:Traitement du Signal
language:
Short-container-title:TS

Author:

Raghu Kogila,Sadanandam Manchala

Abstract

Automatic Speech Recognition (ASR) is a popular research area with many variations in human behaviour functionalities and interactions. Human beings want speech for communication and Conversations. When the conversation is going on, the information or message of the speech utterances is transferred. It also consists of message which includes speaker’s traits like emotion, his or her physiological characteristics and environmental statistics. There is a tremendous number of signals or records that are complex and encoded, but these can be decoded quickly because of human intelligence. Many academics in the domain of Human Computer Interaction (HCI) are working to automate speech generation and the extraction of speech attributes and meaning. For example, ASR can regulate the usage of voice command and maintain dictation discipline while also recognizing and verifying the speech of the speaker. As a result of accent and nativity traits, the speaker's emotional state can be discerned from the speech. In this Paper, we discussed Speech Production System of Human, Research Problems in Speech Processing, SER system Motivation, Challenges and Objectives of Speech Emotion Recognition, so far the work done on Telugu Speech Emotion Databases and their role thoroughly explained. In this Paper, our own Created Database i.e., (DETL) Database for Emotions in Telugu Language and the software Audacity for creating that database is discussed clearly.

Publisher

International Information and Engineering Technology Association

Subject

Electrical and Electronic Engineering