Affiliation:
1. North China University of Technology Beijing China
Abstract
At present, emotion recognition has become a research hotspot in the field of pattern recognition. Considering the problems of incomplete information and strong interference in single‐modal emotion recognition, multimodal emotion recognition has been widely studied. Multimodal data includes, but is not limited to, emoji, text, and voice modality data. There are various ways to express emotion, among which expression, text and voice are the most direct and reliable emotional information carriers. Therefore, it is of great research and practical significance to comprehensively consider the emotion recognition research of expression, text and voice modalities, and to apply its research results to the field of virtual reality (referred to as VR).This paper analyzes the relevant situation of multimodal emotion recognition, extracts features of voice, text and expression, and then fuses them into multimodal for emotional analysis, and applies it to the VR field. The main work content is as follows: the relevant technologies of multimodal emotion recognition research in the field of VR are introduced, including deep learning related technologies, virtual reality technology, and multimodal fusion methods. In terms of deep learning, the focus is on convolutional neural networks and recurrent neural networks and their variants. In terms of virtual reality technology, the characteristics and applications of virtual reality are introduced. In terms of multimodal fusion, three commonly used fusion methods are introduced.
Reference15 articles.
1. Communication without words[J];Rogier A.M.;Tijdschrift Voor Ziekenverpleging,1971
2. Dual-modal emotion recognition based on facial expressions and speech [J];Jingjie Yan;Journal of Nanjing University of Posts and Telecommunications (Natural Science Edition),2018
3. Fusion of audio and video information for multi modal person authentication
4. Learning Effective Features With a Hybrid Deep Model for Audio - Visual Emotion Recognition[J];Shiqing Zhang;IEEE Transactions on Circuits and Systems for Video Technology,2018