Affiliation:
1. Dharamsinh Desai University
Abstract
Abstract
The utilization of emotion detection and recognition technologies has revolution- ized human-computer interactions in various fields such as sentiment analysis, health monitoring, education, and automotive interfaces. Previously, traditional systems relied on single-channel affect sensing, which limited their ability to cap- ture the complexity of human emotions. However, humans naturally combine multiple cues such as facial expressions, speech, gestures, and contextual factors when expressing their emotions. As a result, there has been a growing inter- est in multi-modal emotion frameworks that integrate different sensory streams to obtain more comprehensive emotion assessments. These holistic perspectives allow for the capture of nuanced affective information that would otherwise be difficult to represent. In this survey paper, we delve into the latest advancements in emotion recognition systems, examining fusion techniques, feature engineer- ing methods, and classification architectures that leverage inputs from various modalities such as vision, audio, and text. Our focus is to showcase innova- tive interventions throughout the entire pipeline, from preprocessing raw signals to predicting emotion labels, in order to enable robust multi-modal analysis.
Through detailed theoretical discussions and practical case studies, this paper aims to inspire further research by providing insights into the current state-of- the-art, highlighting open challenges, and exploring promising avenues in emotion detection through cross-modal learning.
Publisher
Research Square Platform LLC
Reference83 articles.
1. Deep learning-based facial emo- tion recognition for human–computer interaction applications;Chowdary MK;Neural Comput Appl,2021
2. Deep-emotion: Facial expression recognition using attentional convolutional network;Minaee S;Sensors,2019
3. Sezgin MC, Gu¨nsel B, Karabulut-Kurt G (2012) : Perceptual audio features for emo- tion detection. EURASIP Journal on Audio, Speech, and Music Processing 1–21 (2012)
4. Bertero D, Fung P (2017) : A first look into a convolutional neural network for speech emotion detection. IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 5115–5119 (2017) 14
5. A review on sentiment analysis and emotion detection from text;Nandwani P;Social Netw Anal Min,2021