Enhanced Deep Learning Hybrid Model of CNN Based on Spatial Transformer Network for Facial Expression Recognition-Reference-Cited by-同舟云学术

Enhanced Deep Learning Hybrid Model of CNN Based on Spatial Transformer Network for Facial Expression Recognition

Published:2022-10-29 Issue:14 Volume:36 Page:
ISSN:0218-0014
Container-title:International Journal of Pattern Recognition and Artificial Intelligence
language:en
Short-container-title:Int. J. Patt. Recogn. Artif. Intell.

Author:

Khan Nizamuddin¹^ORCID,Singh Ajay Vikram¹,Agrawal Rajeev²

Affiliation:

1. Amity Institute of Information Technology, Amity University, Noida 201303, Uttar Pradesh, India

2. Lloyd Institute of Engineering & Technology, Greater Noida 201306, Uttar Pradesh, India

Abstract

One of the most common approaches through which people communicate is facial expressions. A large number of features documented in the literature were created by hand, with the goal of overcoming specific challenges such as occlusions, scale, and illumination variations. These classic methods are then applied to a dataset of facial images or frames in order to train a classifier. The majority of these studies perform admirably on datasets of images shot in a controlled environment, but they struggle with more difficult datasets (FER-2013) that have higher image variation and partial faces. The nonuniform features of the human face as well as changes in lighting, shadows, facial posture, and direction are the key obstacles. Techniques of deep learning have been studied as a set of methodologies for gaining scalability and robustness on new forms of data. In this paper, we look at how well-known deep learning techniques (e.g. GoogLeNet, AlexNet) perform when it comes to facial expression identification, and propose an enhanced hybrid deep learning model based on STN for facial emotion recognition, which gives the best feature extraction and classification in one go and maximizes the accuracy for a large number of samples on FERG, JAFFE, FER-2013, and CK+ datasets. It is capable of focusing on the main parts of the face and attaining extensive development over preceding fashions on the FERG, JAFFE, CK+ datasets, and the more challenging one namely FER-2013.

Publisher

World Scientific Pub Co Pte Ltd

Subject

Artificial Intelligence,Computer Vision and Pattern Recognition,Software

Link

https://www.worldscientific.com/doi/pdf/10.1142/S0218001422520280

Reference54 articles.

1. Emotion Recognition in Children with Autism Spectrum Disorders: Relations to Eye Gaze and Autonomic State

2. Fear-type emotion recognition for future audio-based surveillance systems

Cited by 4 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Optimized hybrid deep learning pipelines for processing heterogeneous facial expression datasets;Measurement: Sensors;2024-02

2. Novel Approach of Facial Expression Recognition for Cross-Datasets;2023 14th International Conference on Computing Communication and Networking Technologies (ICCCNT);2023-07-06

3. Enhancing Feature Extraction Technique Through Spatial Deep Learning Model for Facial Emotion Detection;Annals of Emerging Technologies in Computing;2023-04-01

4. Attentional Deep Learning novel approach for Facial Expression Recognition;2023 6th International Conference on Information Systems and Computer Networks (ISCON);2023-03-03