Affiliation:
1. Nanjing Tech University
2. Airport office of Wuxi Customs Nanjing Customs
3. Northwestern Polytechnical University
Abstract
Abstract
Face expression recognition(FER) is an important research branch in the field of the computer vision neighborhood. Three prevalent problems in FER tasks that severely impact recognition rates are inter-class similarity, intra-class differences, and facial occlusion issues. Although there have been studies that address some of these issues, none of them can adequately address all three issues in a unified framework. In this paper, we propose a novel dual-branch structure of enhanced relation-aware attention and cross-feature fusion transformer network to comprehensively solve all three issues. Specifically, we design the Enhanced Relation-Aware Attention module to maximize the exploration of more local expression features. At the same time, the Transformer Perceptual Encoder module is adopted to establishing the contextual relationship between individual patches under global information. This greatly alleviates the inter-class similarity problem and the facial occlusion and facial pose transformation problems. On the basis of a dual branch structure, we extract facial image features using facial landmarks features to guide them and design Cross-Feature Fusion Transformer module to deeply cross-fuse two different semantic features. Experiments are performed and results show that our method can greatly alleviated intra-class difference problem with comparison of several traditional methods on three commonly used datasets.
Publisher
Research Square Platform LLC
Reference45 articles.
1. F. Noroozi, C. A. Corneanu, D. Kami´nska, T. Sapi´nski, S. Escalera, and G. Anbarjafari, “Survey on emotional body gesture recognition,” IEEE Trans. Affect. Comput., vol. 12, no. 2, pp. 505–523, Apr. 2021.
2. L. He, C. Guo, P. Tiwari, H. M. Pandey, and W. Dang, “Intelligent system for depression scale estimation with facial expressions and case study in industrial intelligence,” Int. J. Intell. Syst., pp. 1–18, Apr. 2021, doi: 10.1002/int.22426.
3. Triplet loss with multi-stage outlier suppression and class-pair margins for facial expression recognition;Xie W;IEEE Trans. Circuits Syst. Video Technol.,2022
4. Amir Hossein Farzaneh and Xiaojun Qi. Facial expression recognition in the wild via deep attentive center loss. In Proceedings of the IEEE/CVF Winter Conference on Applications ofComputer Vision, pages 2402–2411, 2021. 2, 7
5. Deep multi-path convolutional neural network joint with salient region attention for facial expression recognition;Siyue Xie Haifeng;Pattern Recognition,2019