Speech Emotion Recognition Based on Swin-Transformer-Reference-Cited by-同舟云学术

Speech Emotion Recognition Based on Swin-Transformer

Published:2023-05-01 Issue:1 Volume:2508 Page:012056
ISSN:1742-6588
Container-title:Journal of Physics: Conference Series
language:
Short-container-title:J. Phys.: Conf. Ser.

Author:

Liao Zirou,Shen Shaoping

Abstract

Abstract The ability of machines to understand human subjective emotions is an essential link to realize artificial intelligence. How to extract and utilize information from audio signals is still a challenging task. By transforming acoustic signals into time-domain information represented by spectrograms, advanced algorithms in the field of computer vision can be applied to the field of acoustics. In this paper, we propose a Speech Emotion Recognition(SER) system based on Swin-Transformer(Swin). In addition to verifying the feasibility of Swin in SER task, we also compared the effectiveness of various spectrum maps under the same model parameters. Our model is validated on the IEMOCAP dataset and achieves competitive performance.

Publisher

IOP Publishing

Subject

Computer Science Applications,History,Education

Link

https://iopscience.iop.org/article/10.1088/1742-6596/2508/1/012056/pdf

Reference22 articles.

1. Mental Illness Disorder Diagnosis Using EmotionVariation Detection from Continuous English Speech [J];Lalitha;Cmc-Computers Materials & Continua,2021

2. A Review of Automated Speech and Language Features for Assessment of Cognitive and Thought Disorders [J];Voleti;Ieee Journal of Selected Topics in Signal Processing,2020

3. A Comprehensive Review of Speech Emotion Recognition Systems [J];Wani;Ieee Access,2021

4. Speech Emotion Recognition from 3D Log-Mel Spectrograms With Deep Learning Network [J];Meng;Ieee Access,2019

Cited by 3 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Speech emotion recognition using the novel SwinEmoNet (Shifted Window Transformer Emotion Network);International Journal of Speech Technology;2024-07-10

2. Speech emotion recognition via graph-based representations;Scientific Reports;2024-02-23

3. SENet-based speech emotion recognition using synthesis-style transfer data augmentation;International Journal of Speech Technology;2023-12