Graph U-Shaped Network with Mapping-Aware Local Enhancement for Single-Frame 3D Human Pose Estimation-Reference-Cited by-同舟云学术

Graph U-Shaped Network with Mapping-Aware Local Enhancement for Single-Frame 3D Human Pose Estimation

Published:2023-10-02 Issue:19 Volume:12 Page:4120
ISSN:2079-9292
Container-title:Electronics
language:en
Short-container-title:Electronics

Author:

Yu Bing¹^ORCID,Huang Yan¹,Cheng Guang¹,Huang Dongjin¹,Ding Youdong¹

Affiliation:

1. Shanghai Film Academy, Shanghai University, Shanghai 200072, China

Abstract

The development of 2D-to-3D approaches for 3D monocular single-frame human pose estimation faces challenges related to noisy input and failure to capture long-range joint correlations, leading to unreasonable predictions. To this end, we propose a straightforward, but effective U-shaped network called the mapping-aware U-shaped graph convolutional network (M-UGCN) for single-frame applications. This network applies skeletal pooling/unpooling operations to expand the limited convolutional receptive field. For noisy inputs, as local nodes have direct access to the subtle discrepancies between poses, we define an additional mapping-aware local-enhancement mechanism to focus on local node interactions across multiple scales. We evaluated our proposed method on the benchmark datasets Human3.6M and MPI-INF-3DHP, and the experimental results demonstrated the robustness of the M-UGCN against noisy inputs. Notably, the average error in the proposed method was found to be 4.1% lower when compared to state-of-the-art methods adopting similar multi-scale learning approaches.

Funder

Shanghai Natural Science Foundation

Shanghai Talent Development Funding

Publisher

MDPI AG

Subject

Electrical and Electronic Engineering,Computer Networks and Communications,Hardware and Architecture,Signal Processing,Control and Systems Engineering

Link

https://www.mdpi.com/2079-9292/12/19/4120/pdf

Reference58 articles.

1. Xu, T., and Takano, W. (2021, January 19–25). Graph stacked hourglass networks for 3d human pose estimation. Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, Virtual.

2. Zhao, W., Wang, W., and Tian, Y. (2022, January 18–24). GraFormer: Graph-oriented transformer for 3D pose estimation. Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, New Orleans, LA, USA.

3. Zou, Z., and Tang, W. (2021, January 11–17). Modulated graph convolutional network for 3D human pose estimation. Proceedings of the IEEE/CVF International Conference on Computer Vision, Virtual.

4. Zhao, W., and Wang, W. (2022). K-order graph-oriented transformer with GraAttention for 3D pose and shape estimation. arXiv.

5. Zou, Z., Liu, K., Wang, L., and Tang, W. (2020, January 22–25). High-order graph convolutional networks for 3D human pose estimation. Proceedings of the British Machine Vision Conference (BMVC), Virtual.