Recognition of Cross-Language Acoustic Emotional Valence Using Stacked Ensemble Learning-Reference-Cited by-同舟云学术

Recognition of Cross-Language Acoustic Emotional Valence Using Stacked Ensemble Learning

Published:2020-09-27 Issue:10 Volume:13 Page:246
ISSN:1999-4893
Container-title:Algorithms
language:en
Short-container-title:Algorithms

Author:

Zvarevashe Kudakwashe^ORCID,Olugbara Oludayo O.^ORCID

Abstract

Most of the studies on speech emotion recognition have used single-language corpora, but little research has been done in cross-language valence speech emotion recognition. Research has shown that the models developed for single-language speech recognition systems perform poorly when used in different environments. Cross-language speech recognition is a craving alternative, but it is highly challenging because the corpora used will have been recorded in different environments and under varying conditions. The differences in the quality of recording devices, elicitation techniques, languages, and accents of speakers make the recognition task even more arduous. In this paper, we propose a stacked ensemble learning algorithm to recognize valence emotion in a cross-language speech environment. The proposed ensemble algorithm was developed from random decision forest, AdaBoost, logistic regression, and gradient boosting machine and is therefore called RALOG. In addition, we propose feature scaling using random forest recursive feature elimination and a feature selection algorithm to boost the performance of RALOG. The algorithm has been evaluated against four widely used ensemble algorithms to appraise its performance. The amalgam of five benchmarked corpora has resulted in a cross-language corpus to validate the performance of RALOG trained with the selected acoustic features. The comparative analysis results have shown that RALOG gave better performance than the other ensemble learning algorithms investigated in this study.

Publisher

MDPI AG

Subject

Computational Mathematics,Computational Theory and Mathematics,Numerical Analysis,Theoretical Computer Science

Link

https://www.mdpi.com/1999-4893/13/10/246/pdf

Reference56 articles.

1. Automating skin disease diagnosis using image classification;Okuboyejo;World Congr. Eng. Comput. Sci.,2013

2. Attention embedded residual CNN for disease detection in tomato leaves;Karthik;Appl. Soft Comput. J.,2019

3. Speech Emotion Recognition with Heterogeneous Feature Unification of Deep Neural Network

4. Emotion recognition with speech for call centres using LPC and spectral analysis;Ram;Int. J. Adv. Comput. Res.,2013

Cited by 7 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Assessing the effectiveness of ensembles in Speech Emotion Recognition: Performance analysis under challenging scenarios;Expert Systems with Applications;2024-06

2. Chinese Emotionality in Chinese Emic Concepts and its Relevance for Discourse - Influences from Ecology, Thought Systems and Folk Religion;Culture & Psychology;2023-07-24

3. Time-Distributed Attention-Layered Convolution Neural Network with Ensemble Learning using Random Forest Classifier for Speech Emotion Recognition;Journal of Information and Communication Technology;2023

4. A speech corpus of Quechua Collao for automatic dimensional emotion recognition;Scientific Data;2022-12-24

5. Classification of Rice varieties using DMLP-PCA inspired features with MVE Classifier;2022 1st Zimbabwe Conference of Information and Communication Technologies (ZCICT);2022-11-09