SHELF: Combination of Shape Fitting and Heatmap Regression for Landmark Detection in Human Face-Reference-Cited by-同舟云学术

SHELF: Combination of Shape Fitting and Heatmap Regression for Landmark Detection in Human Face

Published:2023-09-26 Issue:3 Volume:10 Page:e3
ISSN:2410-0218
Container-title:EAI Endorsed Transactions on Industrial Networks and Intelligent Systems
language:
Short-container-title:EAI Endorsed Trans Ind Net Intel Syst

Author:

Quyen Ngo Thi Ngoc,Linh Tran Duy,Phuc Vu Hong,Nam Nguyen Van

Abstract

Today, facial emotion recognition is widely adopted in many intelligent applications including the driver monitoring system, the smart customer care as well as the e-learning system. In fact, the human emotions can be well represented by facial landmarks which are hard to be detected from images, due to the high number of discrete landmarks, the variation of shapes and poses of the human face in real world. Over decades, many methods have been proposed for facial landmark detection including the shape fitting, the coordinate regression such as ASMNet and AnchorFace. However, their performance is still limited for real-time applications in terms of both accuracy and efficiency. In this paper, we propose a novel method called SHELF which is the first to combine the shape fitting and heatmap regression approaches for landmark detection in human face. The heatmap model aims to generate the landmarks that fit to the common shapes. The method has been evaluated on three datasets 300W-Challenging, WFLW, 300VW-E with 31557 images and achieved a normalized mean error (NME) of 6.67% , 7.34%, 12.55% correspondingly, which overcomes most existing methods. For the first two datasets, the method is also comparable to the state of the art AnchorFace with a NME of 6.19%, 4.62%, respectively.

Publisher

European Alliance for Innovation n.o.

Subject

Computer Networks and Communications,Computer Science Applications,Information Systems,Control and Systems Engineering

Reference37 articles.

1. Nam, N.V. and Quyen, N.T.N. (2023) Flash: Facial landmark detection using active shape model and heatmap regression. In The 9th EAI International Conference on Industrial Networks and Intelligent Systems.

2. He, K., Zhang, X., Ren, S. and Sun, J. (2016) Deep Residual Learning for Image Recognition. In Proceedings of 2016 IEEE Conference on Computer Vision and Pattern Recognition, CVPR ’16 (IEEE): 770–778. doi:10.1109/CVPR.2016.90, URL http://ieeexplore. ieee.org/document/7780459.

3. Tan, M. and Le, Q.V. (2019) Efficientnet: Rethinking model scaling for convolutional neural networks. In Chaudhuri, K. and Salakhutdinov, R. [eds.] Proceedings of the 36th International Conference on Machine Learning, ICML 2019, 9-15 June 2019, Long Beach, California, USA (PMLR), Proceedings of Machine Learning Research 97: 6105–6114. URL http://proceedings. mlr.press/v97/tan19a.html.

4. Sandler, M., Howard, A.G., Zhu, M., Zhmoginov, A. and Chen, L. (2018) Mobilenetv2: Inverted residuals and linear bottlenecks. In 2018 IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2018, Salt Lake City, UT, USA, June 18-22, 2018 (Computer Vision Foundation / IEEE Computer Society): 4510– 4520. doi:10.1109/CVPR.2018.00474.

5. Ma, N., Zhang, X., Zheng, H.T. and Sun, J. (2018) Shufflenet v2: Practical guidelines for efficient cnn architecture design. In Ferrari, V., Hebert, M., Sminchisescu, C. and Weiss, Y. [eds.] Computer Vision – ECCV 2018 (Cham: Springer International Publishing): 122–138.