Generalisation and robustness investigation for facial and speech emotion recognition using bio-inspired spiking neural networks-Reference-Cited by-同舟云学术

Generalisation and robustness investigation for facial and speech emotion recognition using bio-inspired spiking neural networks

Published:2021-01-16 Issue:3 Volume:25 Page:1717-1730
ISSN:1432-7643
Container-title:Soft Computing
language:en
Short-container-title:Soft Comput

Author:

Mansouri-Benssassi Esma^ORCID,Ye Juan^ORCID

Abstract

AbstractEmotion recognition through facial expression and non-verbal speech represents an important area in affective computing. They have been extensively studied from classical feature extraction techniques to more recent deep learning approaches. However, most of these approaches face two major challenges: (1) robustness—in the face of degradation such as noise, can a model still make correct predictions? and (2) cross-dataset generalisation—when a model is trained on one dataset, can it be used to make inference on another dataset?. To directly address these challenges, we first propose the application of a spiking neural network (SNN) in predicting emotional states based on facial expression and speech data, then investigate, and compare their accuracy when facing data degradation or unseen new input. We evaluate our approach on third-party, publicly available datasets and compare to the state-of-the-art techniques. Our approach demonstrates robustness to noise, where it achieves an accuracy of 56.2% for facial expression recognition (FER) compared to 22.64% and 14.10% for CNN and SVM, respectively, when input images are degraded with the noise intensity of 0.5, and the highest accuracy of 74.3% for speech emotion recognition (SER) compared to 21.95% of CNN and 14.75% for SVM when audio white noise is applied. For generalisation, our approach achieves consistently high accuracy of 89% for FER and 70% for SER in cross-dataset evaluation and suggests that it can learn more effective feature representations, which lead to good generalisation of facial features and vocal characteristics across subjects.

Publisher

Springer Science and Business Media LLC

Subject

Geometry and Topology,Theoretical Computer Science,Software

Link

http://link.springer.com/content/pdf/10.1007/s00500-020-05501-7.pdf

Reference70 articles.

1. Aghdam HH, Heravi EJ, Puig D (2016) Analyzing the stability of convolutional neural networks against image degradation. In ‘VISIGRAPP (4: VISAPP)’. pp 370–382

2. Akasay MB, Oauz K (2020) Speech emotion recognition: emotional models, databases, features, preprocessing methods, supporting modalities, and classifiers. Speech Commun 116:56–76

3. Al-Yasari MMR, Al-Jamali NAS (2018) Modified training algorithm for spiking neural network and its application in wireless sensor network. Energy 5(10)

4. Anagnostopoulos C-N, Iliou T, Giannoukos I (2015) Features and classifiers for emotion recognition from speech: a survey from 2000 to 2011. Artif Intell Rev 43(2):155–177. https://doi.org/10.1007/s10462-012-9368-5

5. Badshah AM, Ahmad J, Rahim N, Baik SW (2017) Speech emotion recognition from spectrograms with deep convolutional neural network. In: 2017 international conference on platform technology and service (PlatCon). IEEE, pp 1–5

Cited by 12 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. The Novel EfficientNet Architecture-Based System and Algorithm to Predict Complex Human Emotions;Algorithms;2024-07-01

2. Channel Selective Relation Network for Efficient Few-shot Facial Expression Recognition;2024 IEEE International Conference on Consumer Electronics (ICCE);2024-01-06

3. Advancements in Affective Disorder Detection: Using Multimodal Physiological Signals and Neuromorphic Computing Based on SNNs;IEEE Transactions on Computational Social Systems;2024

4. BIMODAL EMOTION DEPTH RECOGNITION METHOD OF FACIAL EXPRESSION AND POSTURE IN CYBER-PHYSICAL SYSTEMS, 1-10.;Mechatronic Systems and Control;2024

5. A dual‐channel ensembled deep convolutional neural network for facial expression recognition in the wild;Computational Intelligence;2023-06-06