Cascaded Structure-Learning Network with Using Adversarial Training for Robust Facial Landmark Detection-Reference-Cited by-同舟云学术

Cascaded Structure-Learning Network with Using Adversarial Training for Robust Facial Landmark Detection

Published:2022-02-16 Issue:2 Volume:18 Page:1-20
ISSN:1551-6857
Container-title:ACM Transactions on Multimedia Computing, Communications, and Applications
language:en
Short-container-title:ACM Trans. Multimedia Comput. Commun. Appl.

Author:

Feng Shenming¹,Nong Xingzhong²,Hu Haifeng¹

Affiliation:

1. School of Electronics and Information Technology, Sun Yat-Sen University, Guangzhou, 510006, Guangdong, People’s Republic of China

2. Guangzhou Metro Design & Research Institute Co., Ltd, Guangzhou, Guangdong, People’s Republic of China

Abstract

Recently, great progress has been achieved on facial landmark detection based on convolutional neural network, while it is still challenging due to partial occlusion and extreme head pose. In this paper, we propose a Cascaded Structure-Learning Network (CSLN) with using adversarial training to improve the performance of 2D facial landmark detection by taking the structure of facial landmarks into account. In the first stage, we improve the original stacked hourglass network, which applies a multi-branch module to capture different scales of features, a progressive convolution structure to compensate for the missing structural features in hourglass networks, and a pyramid inception structure to expand the receptive field. Specially, by introducing a discriminator, we use the adversarial training strategy to urge the improved hourglass network for generating more accurate heatmaps. The second stage, which is based on attention mechanism, optimizes the spatial correlations between different facial landmarks by reusing the structural features. Moreover, we propose a novel region loss, which can adaptively allocate proper weights to different regions. In this way, the network can focus more on those occluded landmarks. The experimental results on several datasets, i.e. 300W, COFW, and AFLW, show that our proposed method achieves superior performance compared with the state-of-the-art methods.

Funder

National Natural Science Foundation of China

Natural Science Foundation of Guangdong Province

Science and Technology Program of Huizhou of China

Publisher

Association for Computing Machinery (ACM)

Subject

Computer Networks and Communications,Hardware and Architecture

Link

https://dl.acm.org/doi/pdf/10.1145/3474595

Reference69 articles.

1. Localizing Parts of Faces Using a Consensus of Exemplars

2. BEGAN: Boundary equilibrium generative adversarial networks;Berthelot David;CoRR,2017

3. Adrian Bulat and Georgios Tzimiropoulos. 2017. Binarized convolutional landmark localizers for human pose estimation and face alignment with limited resources. In ICCV. 3726–3734.

4. Robust Face Landmark Estimation under Occlusion

5. Face Alignment by Explicit Shape Regression

Cited by 3 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Unsupervised Adversarial Example Detection of Vision Transformers for Trustworthy Edge Computing;ACM Transactions on Multimedia Computing, Communications, and Applications;2024-07-02

2. Local eye-net: An attention based deep learning architecture for localization of eyes;Expert Systems with Applications;2024-04

3. Face Identification Based on Active Facial Patches Using Multi-Task Cascaded Convolutional Networks;Journal of Advances in Information Technology;2024