Non-speech emotion recognition based on back propagation feed forward networks-Reference-Cited by-同舟云学术

Non-speech emotion recognition based on back propagation feed forward networks

Published:2024-04-18 Issue:4 Volume:46 Page:11067-11077
ISSN:1064-1246
Container-title:Journal of Intelligent & Fuzzy Systems
language:
Short-container-title:IFS

Author:

Zhang Xiwen¹,Xiao Hui¹

Affiliation:

1. Control Science and Engineering, Tongji University, Shanghai, China

Abstract

Non-speech emotion recognition involves identifying emotions conveyed through non-verbal vocalizations such as laughter, crying, and other sound signals, which play a crucial role in emotional expression and transmission. This paper employs a nine-category discrete emotion model encompassing happy, sad, angry, peaceful, fearful, loving, hateful, brave, and neutral. A proprietary non-speech dataset comprising 2337 instances was utilized, with 384-dimensional feature vectors extracted. The traditional Backpropagation Neural Network (BPNN) algorithm achieved a recognition rate of 87.7% on the non-speech dataset. In contrast, the proposed Whale Optimization Algorithm - Backpropagation Neural Network (WOA-BPNN) algorithm, applied to a self-made non-speech dataset, demonstrated a remarkable accuracy of 98.6%. Notably, even without facial emotional cues, non-speech sounds effectively convey dynamic information, and the proposed algorithm excels in their recognition. The study underscores the importance of non-speech emotional signals in communication, especially with the continuous advancement of artificial intelligence technology. The abstract thus encapsulates the paper’s focus on leveraging AI algorithms for high-precision non-speech emotion recognition.

Publisher

IOS Press

Reference14 articles.

1. Discrimination of speech from nonspeech based on multiscale spectro-temporal Modulations

2. A Deep Learning Method Using Gender-Specific Features for Emotion Recognition

3. Speech emotion recognition using data augmentation;Praseetha;International Journal of Speech Technology,2022

4. Mind the gap: On the value of silence representations to lexical-based speech emotion recognition;Perez;Proc. Interspeech,2022

5. Gender identification in a two-level hierarchical speech emotion recognition system for an Italian Social Robot;Guerrieri;Sensors,2022