Semisupervised Deep Features of Time-Frequency Maps for Multimodal Emotion Recognition-Reference-Cited by-同舟云学术

Semisupervised Deep Features of Time-Frequency Maps for Multimodal Emotion Recognition

Published:2023-11-13 Issue: Volume:2023 Page:1-11
ISSN:1098-111X
Container-title:International Journal of Intelligent Systems
language:en
Short-container-title:International Journal of Intelligent Systems

Author:

Zali-Vargahan Behrooz¹,Charmin Asghar¹,Kalbkhani Hashem²^ORCID,Barghandan Saeed¹

Affiliation:

1. Department of Electrical Engineering, Ahar Branch, Islamic Azad University, Ahar, Iran

2. Faculty of Electrical Engineering, Urmia University of Technology, Urmia, Iran

Abstract

Traditional approaches for emotion recognition utilize unimodal physiological signals. The effectiveness of such systems is affected by some limitations. To overcome them, this paper proposes a new method based on time-frequency maps that extract the features from multimodal biological signals. At first, the fusion of electroencephalogram (EEG) and peripheral physiological signal (PPS) is performed, and then, the two-dimensional discrete orthonormal Stockwell transform (2D-DOST) of the multimodal signal matrix is calculated to obtain time-frequency maps. A convolutional neural network (CNN) is then utilized to extract the local deep features from the absolute output of the 2D-DOST. Since there are uninformative deep features, the semisupervised dimension reduction scheme reduces them by balancing the generalization and discrimination. Finally, the classifier recognizes the emotion. The Bayesian optimizer finds the proper SSDR and classifier parameter values to maximize the recognition accuracy. The performance of the proposed method is evaluated on the DEAP dataset considering the two- and four-class scenarios through extensive simulations. This dataset consists of electroencephalograph (EEG) signals in 32 channels and peripheral physiological signals (PPSs) in eight channels from 32 subjects. The proposed method reaches the accuracy of 0.953 and 0.928 for two- and four-class scenarios, respectively. The results indicate the efficiency of the multimodal signals for detecting emotions compared to that of unimodal signals. Also, the results indicate that the proposed method outperforms the recently introduced ones.

Publisher

Hindawi Limited

Subject

Artificial Intelligence,Human-Computer Interaction,Theoretical Computer Science,Software

Link

http://downloads.hindawi.com/journals/ijis/2023/3608115.pdf

Reference41 articles.

1. Three‐dimensional feature maps and convolutional neural network‐based emotion recognition

2. Deep time-frequency features and semi-supervised dimension reduction for subject-independent emotion recognition from multi-channel EEG signals

3. Progressive graph convolution network for EEG emotion recognition

4. A global to local feature 395 aggregation network for EEG emotion recognition;S. Liu;Biomedical Signal Processing and Control,2023

5. A portable HCI system‐oriented EEG feature extraction and channel selection for emotion recognition