A CNN Model for Human Parsing Based on Capacity Optimization-Reference-Cited by-同舟云学术

A CNN Model for Human Parsing Based on Capacity Optimization

Published:2019-03-29 Issue:7 Volume:9 Page:1330
ISSN:2076-3417
Container-title:Applied Sciences
language:en
Short-container-title:Applied Sciences

Author:

Jiang Yalong,Chi Zheru

Abstract

Although a state-of-the-art performance has been achieved in pixel-specific tasks, such as saliency prediction and depth estimation, convolutional neural networks (CNNs) still perform unsatisfactorily in human parsing where semantic information of detailed regions needs to be perceived under the influences of variations in viewpoints, poses, and occlusions. In this paper, we propose to improve the robustness of human parsing modules by introducing a depth-estimation module. A novel scheme is proposed for the integration of a depth-estimation module and a human-parsing module. The robustness of the overall model is improved with the automatically obtained depth labels. As another major concern, the computational efficiency is also discussed. Our proposed human parsing module with 24 layers can achieve a similar performance as the baseline CNN model with over 100 layers. The number of parameters in the overall model is less than that in the baseline model. Furthermore, we propose to reduce the computational burden by replacing a conventional CNN layer with a stack of simplified sub-layers to further reduce the overall number of trainable parameters. Experimental results show that the integration of two modules contributes to the improvement of human parsing without additional human labeling. The proposed model outperforms the benchmark solutions and the capacity of our model is better matched to the complexity of the task.

Funder

Natural Science Foundation of China

Publisher

MDPI AG

Subject

Fluid Flow and Transfer Processes,Computer Science Applications,Process Chemistry and Technology,General Engineering,Instrumentation,General Materials Science

Link

https://www.mdpi.com/2076-3417/9/7/1330/pdf

Reference63 articles.

1. The Pascal Visual Object Classes Challenge: A Retrospective

Cited by 4 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. JPPF: Multi-task Fusion for Consistent Panoptic-Part Segmentation;SN Computer Science;2024-01-10

2. Part-aware Panoptic Segmentation;2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR);2021-06

3. Instance Level Human Parts Detection Using Artificial Neural Networks and Deep Learning;Journal of Physics: Conference Series;2021-05-01

4. Special Features on Intelligent Imaging and Analysis;Applied Sciences;2019-11-10