Affiliation:
1. School of Computer Science and Technology, Dalian University of Technology, Dalian 116000, China
2. Neusoft Reach Automotive Technology (Dalian) Co., Ltd., Dalian 116000, China
Abstract
In real-world scenarios, the facial expression recognition task faces several challenges, including lighting variations, image noise, face occlusion, and other factors, which limit the performance of existing models in dealing with complex situations. To cope with these problems, we introduce the CoT module between the CNN and ViT frameworks, which improves the ability to perceive subtle differences by learning the correlations between local area features at a fine-grained level, helping to maintain the consistency between the local area features and the global expression, and making the model more adaptable to complex lighting conditions. Meanwhile, we adopt an adaptive learning method to effectively eliminate the interference of noise and occlusion by dynamically adjusting the parameters of the Transformer Encoder’s self-attention weight matrix. Experiments demonstrate the accuracy of our CoT_AdaViT model in the Oulu-CASIA dataset as (NIR: 87.94%, VL: strong: 89.47%, weak: 84.76%, dark: 82.28%). As well as, CK+, RAF-DB, and FERPlus datasets achieved 99.20%, 91.07%, and 90.57% recognition results, which achieved excellent performance and verified that the model has strong recognition accuracy and robustness in complex scenes.
Reference55 articles.
1. The early development of emotion recognition in autistic children: Decoding basic emotions from facial expressions and emotion-provoking situations;Li;Underst. Expr. Interact.,1978
2. Munsif, M., Ullah, M., Ahmad, B., Sajjad, M., and Cheikh, F.A. (2022). Monitoring Neurological Disorder Patients via Deep Learning Based Facial Expressions Analysis, Springer.
3. Kabir, M.R., Dewan, M.A.A., and Lin, F. (2023, January 24–26). Lightweight model for emotion detection from facial expression in online learning. Proceedings of the 2023 IEEE Canadian Conference on Electrical and Computer Engineering (CCECE), Regina, SK, Canada.
4. The Effect of Synchrony of Happiness on Facial Expression of Negative Emotion When Lying;Solbu;J. Nonverbal Behav.,2023
5. An automated hyperparameter tuned deep learning model enabled facial emotion recognition for autonomous vehicle drivers;Jain;Image Vis. Comput.,2023
Cited by
3 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献