Facial Micro-Expression Recognition Enhanced by Score Fusion and a Hybrid Model from Convolutional LSTM and Vision Transformer-Reference-Cited by-同舟云学术

Facial Micro-Expression Recognition Enhanced by Score Fusion and a Hybrid Model from Convolutional LSTM and Vision Transformer

Published:2023-06-16 Issue:12 Volume:23 Page:5650
ISSN:1424-8220
Container-title:Sensors
language:en
Short-container-title:Sensors

Author:

Zheng Yufeng¹,Blasch Erik²^ORCID

Affiliation:

1. Department of Data Science, University of Mississippi Medical Center, Jackson, MS 39216, USA

2. MOVEJ Analytics, Fairborn, OH 45324, USA

Abstract

In the billions of faces that are shaped by thousands of different cultures and ethnicities, one thing remains universal: the way emotions are expressed. To take the next step in human–machine interactions, a machine (e.g., a humanoid robot) must be able to clarify facial emotions. Allowing systems to recognize micro-expressions affords the machine a deeper dive into a person’s true feelings, which will take human emotion into account while making optimal decisions. For instance, these machines will be able to detect dangerous situations, alert caregivers to challenges, and provide appropriate responses. Micro-expressions are involuntary and transient facial expressions capable of revealing genuine emotions. We propose a new hybrid neural network (NN) model capable of micro-expression recognition in real-time applications. Several NN models are first compared in this study. Then, a hybrid NN model is created by combining a convolutional neural network (CNN), a recurrent neural network (RNN, e.g., long short-term memory (LSTM)), and a vision transformer. The CNN can extract spatial features (within a neighborhood of an image), whereas the LSTM can summarize temporal features. In addition, a transformer with an attention mechanism can capture sparse spatial relations residing in an image or between frames in a video clip. The inputs of the model are short facial videos, while the outputs are the micro-expressions recognized from the videos. The NN models are trained and tested with publicly available facial micro-expression datasets to recognize different micro-expressions (e.g., happiness, fear, anger, surprise, disgust, sadness). Score fusion and improvement metrics are also presented in our experiments. The results of our proposed models are compared with that of literature-reported methods tested on the same datasets. The proposed hybrid model performs the best, where score fusion can dramatically increase recognition performance.

Publisher

MDPI AG

Subject

Electrical and Electronic Engineering,Biochemistry,Instrumentation,Atomic and Molecular Physics, and Optics,Analytical Chemistry

Link

https://www.mdpi.com/1424-8220/23/12/5650/pdf

Reference36 articles.

1. Darwin, deception, and facial expression;Ekman;Ann. N. Y. Acad. Sci.,2003

2. Zhang, L., and Arandjelović, O. (2021). Review of Automatic Microexpression Recognition in the Past Decade. Mach. Learn. Knowl. Extr., 3.

3. Constants across cultures in the face and emotion;Ekman;J. Pers. Soc. Psychol.,1971

4. Ekman, P. (2009). The Philosophy of Deception, Oxford University Press.

5. Nonverbal leakage and clues to deception;Ekman;Psychiatry,1969

Cited by 3 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Facial and speech Emotional Recognition based on Improved Deep Model;2024-03-01

2. Optimized hybrid deep learning pipelines for processing heterogeneous facial expression datasets;Measurement: Sensors;2024-02

3. Multimodal Gait Abnormality Recognition Using a Convolutional Neural Network–Bidirectional Long Short-Term Memory (CNN-BiLSTM) Network Based on Multi-Sensor Data Fusion;Sensors;2023-11-10