A Video Sequence Face Expression Recognition Method Based on Squeeze-and-Excitation and 3DPCA Network

Author:

Li ChangORCID,Wen Chenglin,Qiu Yiting

Abstract

Expression recognition is a very important direction for computers to understand human emotions and human-computer interaction. However, for 3D data such as video sequences, the complex structure of traditional convolutional neural networks, which stretch the input 3D data into vectors, not only leads to a dimensional explosion, but also fails to retain structural information in 3D space, simultaneously leading to an increase in computational cost and a lower accuracy rate of expression recognition. This paper proposes a video sequence face expression recognition method based on Squeeze-and-Excitation and 3DPCA Network (SE-3DPCANet). The introduction of a 3DPCA algorithm in the convolution layer directly constructs tensor convolution kernels to extract the dynamic expression features of video sequences from the spatial and temporal dimensions, without weighting the convolution kernels of adjacent frames by shared weights. Squeeze-and-Excitation Network is introduced in the feature encoding layer, to automatically learn the weights of local channel features in the tensor features, thus increasing the representation capability of the model and further improving recognition accuracy. The proposed method is validated on three video face expression datasets. Comparisons were made with other common expression recognition methods, achieving higher recognition rates while significantly reducing the time required for training.

Funder

National Natural Science Foundation of China

State Key Laboratory of Rail Traffic Control and Safety

Beijing Jiaotong University

Publisher

MDPI AG

Subject

Electrical and Electronic Engineering,Biochemistry,Instrumentation,Atomic and Molecular Physics, and Optics,Analytical Chemistry

Reference57 articles.

1. Sown, M. (1978, January 7–10). A preliminary note on pattern recognition of facial emotional expression. Proceedings of the 4th International Joint Conferences on Pattern Recognition, Kyoto, Japan.

2. Jain, A.K., and Li, S.Z. (2011). Handbook of Face Recognition, Springer.

3. Facial expression recognition based on local binary patterns: A comprehensive study;Shan;Image Vis. Comput.,2009

4. Facial expression recognition from video sequences: Temporal and static modeling;Cohen;Comput. Vis. Image Underst.,2003

5. Chibelushi, C.C., and Bourel, F. (2003). Facial expression recognition: A brief tutorial overview. CVonline-Line Compend. Comput. Vis., 9.

Cited by 4 articles. 订阅此论文施引文献 订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献

1. ViTCN: Hybrid Vision Transformer with Temporal Convolution for Multi-Emotion Recognition;International Journal of Computational Intelligence Systems;2024-03-27

2. A novel weighted deep convolution model – African vultures optimization algorithm for an automated facial emotion recognition system;Multimedia Tools and Applications;2023-11-28

3. Segmentation of Skeleton Ultrasound Images Based on MMA-SUISNet;2023 IEEE International Conference on Mechatronics and Automation (ICMA);2023-08-06

4. Enhanced spatio-temporal 3D CNN for facial expression classification in videos;Multimedia Tools and Applications;2023-06-28

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3