Author:
Iskhakova A.O.,Wolf D.A.,Galin R.R.,Mamchenko M.V.
Abstract
The article proposes an original convolutional neural network (CNN) for solving the problem of the automatic voice-based assessment of a person’s emotional state. Key principles of such CNNs, and state-of-theart approaches to their design are described. A model of one-dimensional (1-D) CNN based on the human’s inner ear structure is presented. According to the given classification estimates, the proposed CNN model is regarded to be not worse than the known analogues. The linguistic robustness of the given CNN is confirmed; its key advantages in intelligent socio-cyberphysical systems is discussed. The applicability of the developed CNN for solving the problem of voice-based identification of human’s destructive emotions is characterized by the probability of 72.75%.
Reference24 articles.
1. Methods for Determination of Psychophysiological Condition of User Within Smart Environment Based on Complex Analysis of Heterogeneous Data
2. Malov D and Shumskaya 2019 2019 1st International Conference on Control Systems, Mathematical Modelling, Automation and Energy Efficiency (SUMMA) 556
3. Haq S and Jackson P J 2010 Multimodal emotion recognition. Machine audition: principles, algorithms and systems (UK: Loughborough University) p 398
4. Han K, Yu D and Tashev I 2014 INTERSPEECH 2014 223
5. Haq S, Jackson P J B and Edge J D 2008 Proc. Int’l Conf. on Auditory-Visual Speech Processing 185
Cited by
1 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献