Lightweight Three-Dimensional Pose and Joint Center Estimation Model for Rehabilitation Therapy-Reference-Cited by-同舟云学术

Lightweight Three-Dimensional Pose and Joint Center Estimation Model for Rehabilitation Therapy

Published:2023-10-16 Issue:20 Volume:12 Page:4273
ISSN:2079-9292
Container-title:Electronics
language:en
Short-container-title:Electronics

Author:

Kim Yeonggwang¹,Ku Giwon¹^ORCID,Yang Chulseung¹,Lee Jeonggi¹,Kim Jinsul²^ORCID

Affiliation:

1. Korea Electronics Technology Institute, Gwangju 61011, Republic of Korea

2. Department of ICT Convergence System Engineering, Chonnam National University, 77, Yongbong-ro, Buk-gu, Gwangju 500-757, Republic of Korea

Abstract

In this study, we proposed a novel transformer-based model with independent tokens for estimating three-dimensional (3D) human pose and shape from monocular videos, specifically focusing on its application in rehabilitation therapy. The main objective is to recover pixel-aligned rehabilitation-customized 3D human poses and body shapes directly from monocular images or videos, which is a challenging task owing to inherent ambiguity. Existing human pose estimation methods heavily rely on the initialized mean pose and shape as prior estimates and employ parameter regression with iterative error feedback. However, video-based approaches face difficulties capturing joint-level rotational motion and ensuring local temporal consistency despite enhancing single-frame features by modeling the overall changes in the image-level features. To address these limitations, we introduce two types of characterization tokens specifically designed for rehabilitation therapy: joint rotation and camera tokens. These tokens progressively interact with the image features through the transformer layers and encode prior knowledge of human 3D joint rotations (i.e., position information derived from large-scale data). By updating these tokens, we can estimate the SMPL parameters for a given image. Furthermore, we incorporate a temporal model that effectively captures the rotational temporal information of each joint, thereby reducing jitters in local parts. The performance of our method is comparable with those of the current best-performing models. In addition, we present the structural differences among the models to create a pose classification model for rehabilitation. We leveraged ResNet-50 and transformer architectures to achieve a remarkable PA-MPJPE of 49.0 mm for the 3DPW dataset.

Funder

Ministry of Science and ICT (MSIT), Korea

Technology Commercialization Collaboration Platform Construction

Publisher

MDPI AG

Subject

Electrical and Electronic Engineering,Computer Networks and Communications,Hardware and Architecture,Signal Processing,Control and Systems Engineering

Link

https://www.mdpi.com/2079-9292/12/20/4273/pdf

Reference53 articles.

1. Pooyandeh, M., Han, K.-J., and Sohn, I. (2022). Cybersecurity in the AI-Based Metaverse: A Survey. Appl. Sci., 12.

2. Development of metaverse for intelligent healthcare;Wang;Nat. Mach. Intell.,2022

3. Mozumder, M.A.I., Sheeraz, M.M., Athar, A., Aich, S., and Kim, H.C. (2022, January 13–16). Overview: Technology Roadmap of the Future Trend of Metaverse based on IoT, Blockchain, AI Technique, and Medical Domain Metaverse Activity. Proceedings of the 2022 24th International Conference on Advanced Communication Technology (ICACT), Pyeongchang-gun, Republich of Korea.

4. Augmented Reality, Artificial Intelligence, and the Re-Enchantment of the World: With Mohammad Yaqub Chaudhary, “Augmented Reality, Artificial Intelligence, and the Re-Enchantment of the World”; and William Young, “Reverend Robot: Automation and Clergy”;Chaudhary;Zygon,2019

5. Ali, S., Armand, T.P.T., Athar, A., Hussain, A., Ali, M., Yaseen, M., Joo, M.-I., and Kim, H.-C. (2023). Metaverse in Healthcare Integrated with Explainable AI and Blockchain: Enabling Immersiveness, Ensuring Trust, and Providing Patient Data Security. Sensors, 23.