Affiliation:
1. School of Electronics and Information Technology, Sun Yat-Sen University, Guangzhou, 510006, Guangdong, People’s Republic of China
2. Guangzhou Metro Design & Research Institute Co., Ltd, Guangzhou, Guangdong, People’s Republic of China
Abstract
Recently, great progress has been achieved on facial landmark detection based on convolutional neural network, while it is still challenging due to partial occlusion and extreme head pose. In this paper, we propose a
Cascaded Structure-Learning Network (CSLN)
with using adversarial training to improve the performance of 2D facial landmark detection by taking the structure of facial landmarks into account. In the first stage, we improve the original stacked hourglass network, which applies a multi-branch module to capture different scales of features, a progressive convolution structure to compensate for the missing structural features in hourglass networks, and a pyramid inception structure to expand the receptive field. Specially, by introducing a discriminator, we use the adversarial training strategy to urge the improved hourglass network for generating more accurate heatmaps. The second stage, which is based on attention mechanism, optimizes the spatial correlations between different facial landmarks by reusing the structural features. Moreover, we propose a novel region loss, which can adaptively allocate proper weights to different regions. In this way, the network can focus more on those occluded landmarks. The experimental results on several datasets,
i.e.
300W, COFW, and AFLW, show that our proposed method achieves superior performance compared with the state-of-the-art methods.
Funder
National Natural Science Foundation of China
Natural Science Foundation of Guangdong Province
Science and Technology Program of Huizhou of China
Publisher
Association for Computing Machinery (ACM)
Subject
Computer Networks and Communications,Hardware and Architecture
Reference69 articles.
1. Localizing Parts of Faces Using a Consensus of Exemplars
2. BEGAN: Boundary equilibrium generative adversarial networks;Berthelot David;CoRR,2017
3. Adrian Bulat and Georgios Tzimiropoulos. 2017. Binarized convolutional landmark localizers for human pose estimation and face alignment with limited resources. In ICCV. 3726–3734.
4. Robust Face Landmark Estimation under Occlusion
5. Face Alignment by Explicit Shape Regression
Cited by
3 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献