A Pattern Mining Approach for Improving Speech Emotion Recognition-Reference-Cited by-同舟云学术

A Pattern Mining Approach for Improving Speech Emotion Recognition

Published:2022-11 Issue:14 Volume:36 Page:
ISSN:0218-0014
Container-title:International Journal of Pattern Recognition and Artificial Intelligence
language:en
Short-container-title:Int. J. Patt. Recogn. Artif. Intell.

Author:

Avci Umut¹^ORCID

Affiliation:

1. Department of Software Engineering, Yasar University, Izmir, Turkey

Abstract

Speech-driven user interfaces are becoming more common in our lives. To interact with such systems naturally and effectively, machines need to recognize the emotional states of users and respond to them accordingly. At the heart of the emotion recognition research done to this end lies the emotion representation that enables machines to learn and predict emotions. Speech emotion recognition studies use a wide range of low-to-high-level acoustic features for representation purposes such as LLDs, their functionals, and BoAW. In this paper, we present a new method for extracting a novel set of high-level features for classifying emotions. For this purpose, we (1) reduce the dimension of discrete-time speech signals, (2) perform a quantization operation on the new signals and assign a distinct symbol to each quantization level, (3) use the symbol sequences representing the signals to extract discriminative patterns that are capable of distinguishing different emotions from each other, and (4) generate a separate set of features for each emotion from the extracted patterns. Experimental results show that pattern features outperform Energy, Voicing, MFCC, Spectral, and RASTA feature sets. We also demonstrate that combining the pattern-based features and the acoustic features further improves the classification performance.

Publisher

World Scientific Pub Co Pte Ltd

Subject

Artificial Intelligence,Computer Vision and Pattern Recognition,Software

Link

https://www.worldscientific.com/doi/pdf/10.1142/S0218001422500458

Reference69 articles.

1. Threat of Adversarial Attacks on Deep Learning in Computer Vision: A Survey

2. Lecture Notes in Computer Science;Avci U.,2019

Cited by 1 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Self-Labeling Learning Ensemble via Deep Recurrent Neural Network and Self-Representation for Speech Emotion Recognition;International Journal of Pattern Recognition and Artificial Intelligence;2024-06-22