Emotion recognition for human–computer interaction using high-level descriptors-Reference-Cited by-同舟云学术

Emotion recognition for human–computer interaction using high-level descriptors

Published:2024-05-27 Issue:1 Volume:14 Page:
ISSN:2045-2322
Container-title:Scientific Reports
language:en
Short-container-title:Sci Rep

Author:

Singla Chaitanya,Singh Sukhdev,Sharma Preeti,Mittal Nitin,Gared Fikreselam

Abstract

AbstractRecent research has focused extensively on employing Deep Learning (DL) techniques, particularly Convolutional Neural Networks (CNN), for Speech Emotion Recognition (SER). This study addresses the burgeoning interest in leveraging DL for SER, specifically focusing on Punjabi language speakers. The paper presents a novel approach to constructing and preprocessing a labeled speech corpus using diverse social media sources. By utilizing spectrograms as the primary feature representation, the proposed algorithm effectively learns discriminative patterns for emotion recognition. The method is evaluated on a custom dataset derived from various Punjabi media sources, including films and web series. Results demonstrate that the proposed approach achieves an accuracy of 69%, surpassing traditional methods like decision trees, Naïve Bayes, and random forests, which achieved accuracies of 49%, 52%, and 61% respectively. Thus, the proposed method improves accuracy in recognizing emotions from Punjabi speech signals.

Publisher

Springer Science and Business Media LLC

Link

https://www.nature.com/articles/s41598-024-59294-y.pdf

Reference42 articles.

1. Rout, J. K. et al. A model for sentiment and emotion analysis of unstructured social media text. Electron. Commer. Res. 18(1), 181–199. https://doi.org/10.1007/s10660-017-9257-8 (2017).

2. Ayata, D., Yaslan, Y. & Kamasak, M. E. Emotion recognition from multimodal physiological signals for emotion aware healthcare systems. J. Med. Biol. Eng. 40(2), 149–157 (2020).

3. Dong, Z., Wei, J., Chen, X. & Zheng, P. Face detection in security monitoring based on artificial intelligence video retrieval technology. IEEE Access 8, 63421–63433 (2020).

4. Xu, Z. et al. Social sensors based online attention computing of public safety events. IEEE Trans. Emerg. Top. Comput. 5(3), 403–411. https://doi.org/10.1109/tetc.2017.2684819 (2017).

5. Ekman, P. An argument for basic emotions. Cogn. Emotion 6(3/4), 169–200 (1992).

Cited by 1 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. MRSAPose: Multi-level routing sparse attention for multi-person pose estimation;Expert Systems with Applications;2024-12