Human-Computer Interaction with Detection of Speaker Emotions Using Convolution Neural Networks-Reference-Cited by-同舟云学术

Human-Computer Interaction with Detection of Speaker Emotions Using Convolution Neural Networks

Published:2022-03-31 Issue: Volume:2022 Page:1-16
ISSN:1687-5273
Container-title:Computational Intelligence and Neuroscience
language:en
Short-container-title:Computational Intelligence and Neuroscience

Author:

Alnuaim Abeer Ali¹^ORCID,Zakariah Mohammed²^ORCID,Alhadlaq Aseel¹,Shashidhar Chitra³,Hatamleh Wesam Atef⁴,Tarazi Hussam⁵,Shukla Prashant Kumar⁶,Ratna Rajnish⁷^ORCID

Affiliation:

1. Department of Computer Science and Engineering, College of Applied Studies and Community Services, King Saud University, P.O. BOX 22459, Riyadh 11495, Saudi Arabia

2. College of Computer and Information Sciences, King Saud University, P.O. Box 51178, Riyadh 11543, Saudi Arabia

3. Department of Commerce and Management, Seshadripuram College, Seshadripuram, Bengaluru-20, India

4. Department of Computer Science, College of Computer and Information Sciences, King Saud University, P.O. Box 51178, Riyadh 11543, Saudi Arabia

5. Department of Computer Science and Informatics, School of Engineering and Computer Science, Oakland University, 318 Meadow Brook Rd, Rochester MI 48309, USA

6. Department of Computer Science and Engineering, Koneru Lakshmaiah Education Foundation, Vaddeswaram, Guntur 522502, Andhra Pradesh, India

7. Gedu College of Business Studies, Royal University of Bhutan, Gedu, Bhutan

Abstract

Emotions play an essential role in human relationships, and many real-time applications rely on interpreting the speaker’s emotion from their words. Speech emotion recognition (SER) modules aid human-computer interface (HCI) applications, but they are challenging to implement because of the lack of balanced data for training and clarity about which features are sufficient for categorization. This research discusses the impact of the classification approach, identifying the most appropriate combination of features and data augmentation on speech emotion detection accuracy. Selection of the correct combination of handcrafted features with the classifier plays an integral part in reducing computation complexity. The suggested classification model, a 1D convolutional neural network (1D CNN), outperforms traditional machine learning approaches in classification. Unlike most earlier studies, which examined emotions primarily through a single language lens, our analysis looks at numerous language data sets. With the most discriminating features and data augmentation, our technique achieves 97.09%, 96.44%, and 83.33% accuracy for the BAVED, ANAD, and SAVEE data sets, respectively.

Funder

King Saud University

Publisher

Hindawi Limited

Subject

General Mathematics,General Medicine,General Neuroscience,General Computer Science

Link

http://downloads.hindawi.com/journals/cin/2022/7463091.pdf

Reference67 articles.

1. A survey of affect recognition methods: audio, visual, and spontaneous expressions;Z. Zeng;IEEE Transactions on Pattern Analysis and Machine Intelligence,2008

2. Automatic recognition of emotions from speech: a review of the literature and recommendations for practical realisation;T. Vogt,2008

3. Context-sensitive multimodal emotion recognition from speech and facial expression using bidirectional lstm modeling;M. Wöllmer

4. Discrete Wavelet Transforms and Artificial Neural Networks for Speech Emotion Recognition

5. A review of speech-based bimodal recognition

Cited by 49 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Speech emotion recognition in real static and dynamic human-robot interaction scenarios;Computer Speech & Language;2025-01

2. Multimedia Human-Computer Interaction Method in Video Animation Based on Artificial Intelligence Technology;International Journal of Information Technology and Web Engineering;2024-05-24

3. Enhancing Speech Emotion Recognition through a Cross-Dataset Analysis: Exploring Improved Models;2024 IEEE 4th International Maghreb Meeting of the Conference on Sciences and Techniques of Automatic Control and Computer Engineering (MI-STA);2024-05-19

4. Propagation Dynamics and Connectivity Assurance in Sensor Radio Networks;2024 International Conference on Communication, Computer Sciences and Engineering (IC3SE);2024-05-09

5. Impact of social media on the evolution of English semantics through linguistic analysis;Forum for Linguistic Studies;2024-03-20