Speech Emotion Recognition Using ANFIS and PSO-optimization With Word2Vec-Reference-Cited by-同舟云学术

Speech Emotion Recognition Using ANFIS and PSO-optimization With Word2Vec

Published:2022-12-19 Issue: Volume: Page:
ISSN:
Container-title:
language:
Short-container-title:

Author:

rezaie vahid¹^ORCID,Parnianifard Amir²,Rodriguez Demostenes Zegarra³,Mumtaz Shahid⁴,Wuttisittikulkij Lunchakorn²

Affiliation:

1. Yazd University

2. Chulalongkorn University Faculty of Engineering

3. Federal University of Lavras: Universidade Federal de Lavras

4. Teleton Santiago Institute: Instituto Teleton Santiago

Abstract

Abstract Speech Emotion Recognition (SER) plays a vital role in human-computer interaction as an important branch of affective computing. Due to inconsistencies in the data and challenging signal extraction, in this paper, we propose a novel emotion recognition method based on the combination of Adaptive Neuro-Fuzzy Inference System (ANFIS) and Particle Swarm Optimization (PSO) with Word to Vector (Word2Vec) models. To begin, the inputs have been pre-processed, which comprise audio and text data. Second, the features were extracted using the Word2vec behind spectral and prosodic approaches. Finally, the features are selected using the Sequential Backward Floating Selection (SBFS) approach. In the end, the ANFIS-PSO model has been used to recognize speech emotion. A performance evaluation of the proposed algorithm is carried out on Sharif Emotional Speech Database (ShEMO). The experimental results show that the proposed algorithm has advantages in accuracy, reaching 0.873 and 0.752 in males and females, respectively, in comparison with the CNNs and SVM, MLP, RF models.

Publisher

Research Square Platform LLC

Reference50 articles.

1. EEG emotion recognition using fusion model of graph convolutional neural networks and LSTM;Yin Y;Appl Soft Comput,2021

2. Li J, Deng L, Haeb-Umbach R, Gong Y (2016) Fundamentals of speech recognition. ” in Robust Automatic Speech Recognition. Elsevier, pp 9–40

3. Emotion recognition of speech signal using Taylor series and deep belief network based classification;Valiyavalappil Haridas A;Evol Intell,2020

4. Speech emotion recognition: Emotional models, databases, features, pre-processing methods, supporting modalities, and classifiers;Akçay MB;Speech Commun,2020

5. Soofi A, Awan A (2017) “Classification Techniques in Machine Learning: Applications and Issues,” J. Basic Appl. Sci., vol. 13, no. August, pp. 459–465, doi: 10.6000/1927-5129.2017.13.76