Speech emotion classification using fractal dimension-based features-Reference-Cited by-同舟云学术

Speech emotion classification using fractal dimension-based features

Published:2019-09-26 Issue:5 Volume:24 Page:
ISSN:2335-8963
Container-title:Nonlinear Analysis: Modelling and Control
language:
Short-container-title:NAMC

Author:

Tamulevičius Gintautas,Karbauskaitė Rasa,Dzemyda Gintautas

Abstract

During the last 10–20 years, a great deal of new ideas have been proposed to improve the accuracy of speech emotion recognition: e.g., effective feature sets, complex classification schemes, and multi-modal data acquisition. Nevertheless, speech emotion recognition is still the task in limited success. Considering the nonlinear and fluctuating nature of the emotional speech, in this paper, we present fractal dimension-based features for speech emotion classification. We employed Katz, Castiglioni, Higuchi, and Hurst exponent-based features and their statistical functionals to establish the 224-dimensional full feature set. The dimension was downsized by applying the Sequential Forward Selection technique. The results of experimental study show a clear superiority of fractal dimension-based feature sets against the acoustic ones. The average accuracy of 96.5% was obtained using the reduced feature sets. The feature selection enabled us to obtain the 4-dimensional and 8-dimensional sets for Lithuanian and German emotions, respectively.

Publisher

Vilnius University Press

Subject

Applied Mathematics,Analysis

Cited by 11 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Supervised Machine-Learning Methodology for Industrial Robot Positional Health Using Artificial Neural Networks, Discrete Wavelet Transform, and Nonlinear Indicators;Sensors;2023-03-17

2. Fractional-Order Calculus-Based Data Augmentation Methods for Environmental Sound Classification with Deep Learning;Fractal and Fractional;2022-09-29

3. THE EFFECT OF NOISE AND NONLINEAR NOISE REDUCTION METHODS ON THE FRACTAL DIMENSION OF CHAOTIC TIME SERIES;Fractals;2021-11-22

4. Attention guided 3D CNN-LSTM model for accurate speech based emotion recognition;Applied Acoustics;2021-11

5. Fractional Differential Equation-Based Instantaneous Frequency Estimation for Signal Reconstruction;Fractal and Fractional;2021-07-30