Enhanced Deep Learning Hybrid Model of CNN Based on Spatial Transformer Network for Facial Expression Recognition-Reference-Cited by-同舟云学术

Enhanced Deep Learning Hybrid Model of CNN Based on Spatial Transformer Network for Facial Expression Recognition

Published:2022-10-29 Issue:14 Volume:36 Page:
ISSN:0218-0014
Container-title:International Journal of Pattern Recognition and Artificial Intelligence
language:en
Short-container-title:Int. J. Patt. Recogn. Artif. Intell.

Author:

Khan Nizamuddin¹^ORCID,Singh Ajay Vikram¹,Agrawal Rajeev²

Affiliation:

1. Amity Institute of Information Technology, Amity University, Noida 201303, Uttar Pradesh, India

2. Lloyd Institute of Engineering & Technology, Greater Noida 201306, Uttar Pradesh, India

Abstract

One of the most common approaches through which people communicate is facial expressions. A large number of features documented in the literature were created by hand, with the goal of overcoming specific challenges such as occlusions, scale, and illumination variations. These classic methods are then applied to a dataset of facial images or frames in order to train a classifier. The majority of these studies perform admirably on datasets of images shot in a controlled environment, but they struggle with more difficult datasets (FER-2013) that have higher image variation and partial faces. The nonuniform features of the human face as well as changes in lighting, shadows, facial posture, and direction are the key obstacles. Techniques of deep learning have been studied as a set of methodologies for gaining scalability and robustness on new forms of data. In this paper, we look at how well-known deep learning techniques (e.g. GoogLeNet, AlexNet) perform when it comes to facial expression identification, and propose an enhanced hybrid deep learning model based on STN for facial emotion recognition, which gives the best feature extraction and classification in one go and maximizes the accuracy for a large number of samples on FERG, JAFFE, FER-2013, and CK+ datasets. It is capable of focusing on the main parts of the face and attaining extensive development over preceding fashions on the FERG, JAFFE, CK+ datasets, and the more challenging one namely FER-2013.

Publisher

World Scientific Pub Co Pte Ltd

Subject

Artificial Intelligence,Computer Vision and Pattern Recognition,Software

Link

https://www.worldscientific.com/doi/pdf/10.1142/S0218001422520280

Reference54 articles.

1. Emotion Recognition in Children with Autism Spectrum Disorders: Relations to Eye Gaze and Autonomic State

2. Fear-type emotion recognition for future audio-based surveillance systems

Cited by 8 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. A Transformer Warning Method Based on Pattern Recognition and Statistical Analysis;2024 IEEE 6th Advanced Information Management, Communicates, Electronic and Automation Control Conference (IMCEC);2024-05-24

2. Optimized hybrid deep learning pipelines for processing heterogeneous facial expression datasets;Measurement: Sensors;2024-02

3. Investigating the Use of Spatial Transformer Networks and Recurrent Neural Networks for Medical Image Segmentation;2024 International Conference on Optimization Computing and Wireless Communication (ICOCWC);2024-01-29

4. A Business Process for Detecting Facial Movements and Emotions Using Deep Learning Techniques;2023 International Conference on Electrical, Communication and Computer Engineering (ICECCE);2023-12-30

5. Facial Expression Recognition Based on Dual Scale Hybrid Attention Mechanism;2023 5th International Conference on Control and Robotics (ICCR);2023-11-23