Texture and Geometry Scattering Representation-Based Facial Expression Recognition in 2D+3D Videos-Reference-Cited by-同舟云学术

Texture and Geometry Scattering Representation-Based Facial Expression Recognition in 2D+3D Videos

Published:2018-03-31 Issue:1s Volume:14 Page:1-23
ISSN:1551-6857
Container-title:ACM Transactions on Multimedia Computing, Communications, and Applications
language:en
Short-container-title:ACM Trans. Multimedia Comput. Commun. Appl.

Author:

Yao Yongqiang¹,Huang Di¹^ORCID,Yang Xudong²,Wang Yunhong²,Chen Liming³

Affiliation:

1. State Key Laboratory of Software Development Environment, Beihang University, Beijing, China

2. IRIP Lab, School of Computer Science and Engineering, Beihang University, Beijing, China

3. LIRIS, Département Mathématiques Informatique, Ecole Centrale de Lyon, Ecully, France

Abstract

Facial Expression Recognition (FER) is one of the most important topics in the domain of computer vision and pattern recognition, and it has attracted increasing attention for its scientific challenges and application potentials. In this article, we propose a novel and effective approach to FER using multi-model two-dimensional (2D) and 3D videos, which encodes both static and dynamic clues by scattering convolution network. First, a shape-based detection method is introduced to locate the start and the end of an expression in videos; segment its onset, apex, and offset states; and sample the important frames for emotion analysis. Second, the frames in Apex of 2D videos are represented by scattering, conveying static texture details. Those of 3D videos are processed in a similar way, but to highlight static shape details, several geometric maps in terms of multiple order differential quantities, i.e., Normal Maps and Shape Index Maps, are generated as the input of scattering, instead of original smooth facial surfaces. Third, the average of neighboring samples centred at each key texture frame or shape map in Onset is computed, and the scattering features extracted from all the average samples of 2D and 3D videos are then concatenated to capture dynamic texture and shape cues, respectively. Finally, Multiple Kernel Learning is adopted to combine the features in the 2D and 3D modalities and compute similarities to predict the expression label. Thanks to the scattering descriptor, the proposed approach not only encodes distinct local texture and shape variations of different expressions as by several milestone operators, such as SIFT, HOG, and so on, but also captures subtle information hidden in high frequencies in both channels, which is quite crucial to better distinguish expressions that are easily confused. The validation is conducted on the BU-4DFE and BP-4D databa ses, and the accuracies reached are very competitive, indicating its competency for this issue.

Funder

Research Program of State Key Laboratory of Software Development Environment

Partner University Foundation

PUF 4D Vision project

National Natural Science Foundation of China

Microsoft Research Asia Collaborative Program

French Research Agency

National Key Research and Development Plan

l'Agence Nationale de Recherche

Jemime project

Publisher

Association for Computing Machinery (ACM)

Subject

Computer Networks and Communications,Hardware and Architecture

Link

https://dl.acm.org/doi/pdf/10.1145/3131345

Reference57 articles.

1. Facial expression recognition and synthesis based on an appearance model

2. Spontaneous expression detection from 3D dynamic sequences by analyzing trajectories on grassmann manifolds;Alashkar Taleb;IEEE Transactions on Affective Computing PP,2016

3. 4-D Facial Expression Recognition by Learning Geometric Deformations

4. Automatic facial expression recognition in real-time from dynamic sequences of 3D face scans

5. A Set of Selected SIFT Features for 3D Facial Expression Recognition

Cited by 29 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Suitable and Style-Consistent Multi-Texture Recommendation for Cartoon Illustrations;ACM Transactions on Multimedia Computing, Communications, and Applications;2024-05-16

2. Cross-Domain Facial Expression Recognition by Combining Transfer Learning and Face-Cycle Generative Adversarial Network;Multimedia Tools and Applications;2024-03-11

3. A Comprehensive Survey on Affective Computing: Challenges, Trends, Applications, and Future Directions;IEEE Access;2024

4. Spiking-Fer: Spiking Neural Network for Facial Expression Recognition With Event Cameras;20th International Conference on Content-based Multimedia Indexing;2023-09-20

5. Facial expression recognition method based on PSA—YOLO network;Frontiers in Neurorobotics;2023-01-17