EmoMatchSpanishDB: study of speech emotion recognition machine learning models in a new Spanish elicited database-Reference-Cited by-同舟云学术

EmoMatchSpanishDB: study of speech emotion recognition machine learning models in a new Spanish elicited database

Published:2023-07-04 Issue:5 Volume:83 Page:13093-13112
ISSN:1573-7721
Container-title:Multimedia Tools and Applications
language:en
Short-container-title:Multimed Tools Appl

Author:

Garcia-Cuesta Esteban,Salvador Antonio Barba,Pãez Diego Gachet

Abstract

AbstractIn this paper we present a new speech emotion dataset on Spanish. The database is created using an elicited approach and is composed by fifty non-actors expressing the Ekman’s six basic emotions of anger, disgust, fear, happiness, sadness, and surprise, plus neutral tone. This article describes how this database has been created from the recording step to the performed crowdsourcing perception test step. The crowdsourcing has facilitated to statistically validate the emotion of each collected audio sample and also to filter noisy data samples. Hence we obtained two datasets EmoSpanishDB and EmoMatchSpanishDB. The first includes those recorded audios that had consensus during the crowdsourcing process. The second selects from EmoSpanishDB only those audios whose emotion also matches with the originally elicited. Last, we present a baseline comparative study between different state of the art machine learning techniques in terms of accuracy, precision, and recall for both datasets. The results obtained for EmoMatchSpanishDB improves the ones obtained for EmoSpanishDB and thereof, we recommend to follow the methodology that was used for the creation of emotional databases.

Funder

Universidad Europea de Madrid

Universidad Politécnica de Madrid

Publisher

Springer Science and Business Media LLC

Subject

Computer Networks and Communications,Hardware and Architecture,Media Technology,Software

Link

https://link.springer.com/content/pdf/10.1007/s11042-023-15959-w.pdf

Reference48 articles.

1. Amer MR, Siddiquie B, Richey C, Divakaran A (2014) Emotion recognition in speech using deep networks. In: ICASSP. Florence, Italy, pp 3752–3756

2. Attwood AS, Easey KE, Dalili MN, Skinner AL, Woods A, Crick L, Ilett E, Penton-Voak IS, Munafó MR (2017) State anxiety and emotional face recognition in healthy volunteers. R Soc Open Sci. 4(5:160855

3. Burkhardt F, Paeschke, Rolfes M, Sendlmeier W, Weiss B (2005) 1129 A database of German emotional speech. In: Proc. Interspeech, pp. 1517–1520

4. Byun S, Lee S (2016) Emotion Recognition Using Tone and Tempo Based on Voice for IoT. Trans Korean Inst Electr Eng 65:116–121

5. Calvo RA, D’Mello S (2012) Frontiers of Affect-Aware Learning Technologies. Intell. Syst. IEEE. 27(27):86–89

Cited by 1 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. A Combined CNN Architecture for Speech Emotion Recognition;Sensors;2024-09-06