Bimodal Emotion Recognition using Machine Learning
-
Published:2021-04-30
Issue:4
Volume:10
Page:189-194
-
ISSN:2249-8958
-
Container-title:International Journal of Engineering and Advanced Technology
-
language:en
-
Short-container-title:IJEAT
Author:
S* Manisha1, Nafisa H Saida2, Gopal Nandita1, Anand Roshni P1
Affiliation:
1. Department of CSE, Sri Sivasubramaniya Nadar College of Engineering, Chennai, India. 2. , Department of CSE, Sri Sivasubramaniya Nadar College of Engineering, Chennai, India.
Abstract
The predominant communication channel to convey
relevant and high impact information is the emotions that is
embedded on our communications. Researchers have tried to
exploit these emotions in recent years for human robot
interactions (HRI) and human computer interactions (HCI).
Emotion recognition through speech or through facial expression
is termed as single mode emotion recognition. The rate of
accuracy of these single mode emotion recognitions are improved
using the proposed bimodal method by combining the modalities
of speech and facing and recognition of emotions using a
Convolutional Neural Network (CNN) model. In this paper, the
proposed bimodal emotion recognition system, contains three
major parts such as processing of audio, processing of video and
fusion of data for detecting the emotion of a person. The fusion of
visual information and audio data obtained from two different
channels enhances the emotion recognition rate by providing the
complementary data. The proposed method aims to classify 7
basic emotions (anger, disgust, fear, happy, neutral, sad, surprise)
from an input video. We take audio and image frame from the
video input to predict the final emotion of a person. The dataset
used is an audio-visual dataset uniquely suited for the study of
multi-modal emotion expression and perception. Dataset used
here is RAVDESS dataset which contains audio-visual dataset,
visual dataset and audio dataset. For bimodal emotion detection
the audio-visual dataset is used.
Publisher
Blue Eyes Intelligence Engineering and Sciences Engineering and Sciences Publication - BEIESP
Subject
Computer Science Applications,General Engineering,Environmental Engineering
Reference17 articles.
1. Livingstone, S.R. and Russo, F.A, 2018, "The Ryerson Audio-Visual Database of Emotional Speech and Song (RAVDESS): A dynamic, multimodal set of facial and vocal expressions in North American English", PloS one, 13(5). 2. Yadav, S. and Shukla, S, 2016, "Analysis of k-Fold Cross-Validation over Hold-Out Validation on Colossal Datasets for Quality Classification", 2016, IEEE 6th International Conference on Advanced Computing (IACC), Bhimavaram, pp. 78-83. 3. Pinto, A., Pereira, S., Dinis, H., Silva, C. A. and Rasteiro, D. M. L. D., 2015, "Random decision forests for automatic brain tumor segmentation on multimodal MRI images", 2015 IEEE 4th Portuguese Meeting on Bioengineering (ENBENG), Porto, pp. 1-5. 4. Arriaga, O., Valdenegro-Toro, M. and Ploger, P., 2017. "Real-time convolutional neural networks for emotion and gender classification,", arXiv preprint arXiv: 1710.07557. 5. Wang, K., An, N., Li, B.N., Zhang, Y. and Li, L., 2015, "Speech emotion recognition using Fourier parameters", IEEE Transactions on affective computing, 6(1), pp.69-75
Cited by
2 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献
|
|