A Study of Cross-Linguistic Speech Emotion Recognition Based on 2D Feature Spaces-Reference-Cited by-同舟云学术

A Study of Cross-Linguistic Speech Emotion Recognition Based on 2D Feature Spaces

Published:2020-10-20 Issue:10 Volume:9 Page:1725
ISSN:2079-9292
Container-title:Electronics
language:en
Short-container-title:Electronics

Author:

Tamulevičius Gintautas,Korvel Gražina^ORCID,Yayak Anil Bora^ORCID,Treigys Povilas,Bernatavičienė Jolita^ORCID,Kostek Bożena^ORCID

Abstract

In this research, a study of cross-linguistic speech emotion recognition is performed. For this purpose, emotional data of different languages (English, Lithuanian, German, Spanish, Serbian, and Polish) are collected, resulting in a cross-linguistic speech emotion dataset with the size of more than 10.000 emotional utterances. Despite the bi-modal character of the databases gathered, our focus is on the acoustic representation only. The assumption is that the speech audio signal carries sufficient emotional information to detect and retrieve it. Several two-dimensional acoustic feature spaces, such as cochleagrams, spectrograms, mel-cepstrograms, and fractal dimension-based space, are employed as the representations of speech emotional features. A convolutional neural network (CNN) is used as a classifier. The results show the superiority of cochleagrams over other feature spaces utilized. In the CNN-based speaker-independent cross-linguistic speech emotion recognition (SER) experiment, the accuracy of over 90% is achieved, which is close to the monolingual case of SER.

Publisher

MDPI AG

Subject

Electrical and Electronic Engineering,Computer Networks and Communications,Hardware and Architecture,Signal Processing,Control and Systems Engineering

Link

https://www.mdpi.com/2079-9292/9/10/1725/pdf

Reference58 articles.

1. Affective Computing and Sentiment Analysis

2. Multilingual sentiment analysis: from formal to informal and scarce resource languages

3. Emotion recognition from multichannel EEG signals using K-nearest neighbor classification

4. Stress emotion recognition based on RSP and EMG signals;Wei,2013

5. Attention-LSTM-Attention Model for Speech Emotion Recognition and Analysis of IEMOCAP Database

Cited by 27 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Automatic Accent Identification Using Less Data: a Shift from Global to Segmental Accent;Arabian Journal for Science and Engineering;2024-08-13

2. Speech Emotion Recognition Using a Multi-Time-Scale Approach to Feature Aggregation and an Ensemble of SVM Classifiers;Archives of Acoustics;2024-05-28

3. Semi-supervised cross-lingual speech emotion recognition;Expert Systems with Applications;2024-03

4. Cochleagram to Recognize Dysphonia: Auditory Perceptual Analysis for Health Informatics;IEEE Access;2024

5. Human-Robot Interaction Communication Control System Using Lithuanian Language;Baltic Journal of Modern Computing;2024