HDFormer: High-order Directed Transformer for 3D Human Pose Estimation-Reference-Cited by-同舟云学术

HDFormer: High-order Directed Transformer for 3D Human Pose Estimation

Published:2023-08 Issue: Volume: Page:
ISSN:
Container-title:Proceedings of the Thirty-Second International Joint Conference on Artificial Intelligence
language:
Short-container-title:

Author:

Chen Hanyuan¹,He Jun-Yan¹,Xiang Wangmeng¹,Cheng Zhi-Qi²,Liu Wei¹,Liu Hanbing³,Luo Bin¹,Geng Yifeng¹,Xie Xuansong¹

Affiliation:

1. Alibaba Group

2. Carnegie Mellon University

3. Tsinghua University

Abstract

Human pose estimation is a challenging task due to its structured data sequence nature. Existing methods primarily focus on pair-wise interaction of body joints, which is insufficient for scenarios involving overlapping joints and rapidly changing poses. To overcome these issues, we introduce a novel approach, the High-order Directed Transformer (HDFormer), which leverages high-order bone and joint relationships for improved pose estimation. Specifically, HDFormer incorporates both self-attention and high-order attention to formulate a multi-order attention module. This module facilitates first-order "joint-joint", second-order "bone-joint", and high-order "hyperbone-joint" interactions, effectively addressing issues in complex and occlusion-heavy situations. In addition, modern CNN techniques are integrated into the transformer-based architecture, balancing the trade-off between performance and efficiency. HDFormer significantly outperforms state-of-the-art (SOTA) models on Human3.6M and MPI-INF-3DHP datasets, requiring only 1/10 of the parameters and significantly lower computational costs. Moreover, HDFormer demonstrates broad real-world applicability, enabling real-time, accurate 3D pose estimation. The source code is in https://github.com/hyer/HDFormer.

Publisher

International Joint Conferences on Artificial Intelligence Organization

Cited by 1 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Human-Object-Object Interaction: Towards Human-Centric Complex Interaction Detection;Proceedings of the 31st ACM International Conference on Multimedia;2023-10-26