Lightweight Three-Dimensional Pose and Joint Center Estimation Model for Rehabilitation Therapy
-
Published:2023-10-16
Issue:20
Volume:12
Page:4273
-
ISSN:2079-9292
-
Container-title:Electronics
-
language:en
-
Short-container-title:Electronics
Author:
Kim Yeonggwang1, Ku Giwon1ORCID, Yang Chulseung1, Lee Jeonggi1, Kim Jinsul2ORCID
Affiliation:
1. Korea Electronics Technology Institute, Gwangju 61011, Republic of Korea 2. Department of ICT Convergence System Engineering, Chonnam National University, 77, Yongbong-ro, Buk-gu, Gwangju 500-757, Republic of Korea
Abstract
In this study, we proposed a novel transformer-based model with independent tokens for estimating three-dimensional (3D) human pose and shape from monocular videos, specifically focusing on its application in rehabilitation therapy. The main objective is to recover pixel-aligned rehabilitation-customized 3D human poses and body shapes directly from monocular images or videos, which is a challenging task owing to inherent ambiguity. Existing human pose estimation methods heavily rely on the initialized mean pose and shape as prior estimates and employ parameter regression with iterative error feedback. However, video-based approaches face difficulties capturing joint-level rotational motion and ensuring local temporal consistency despite enhancing single-frame features by modeling the overall changes in the image-level features. To address these limitations, we introduce two types of characterization tokens specifically designed for rehabilitation therapy: joint rotation and camera tokens. These tokens progressively interact with the image features through the transformer layers and encode prior knowledge of human 3D joint rotations (i.e., position information derived from large-scale data). By updating these tokens, we can estimate the SMPL parameters for a given image. Furthermore, we incorporate a temporal model that effectively captures the rotational temporal information of each joint, thereby reducing jitters in local parts. The performance of our method is comparable with those of the current best-performing models. In addition, we present the structural differences among the models to create a pose classification model for rehabilitation. We leveraged ResNet-50 and transformer architectures to achieve a remarkable PA-MPJPE of 49.0 mm for the 3DPW dataset.
Funder
Ministry of Science and ICT (MSIT), Korea Technology Commercialization Collaboration Platform Construction
Subject
Electrical and Electronic Engineering,Computer Networks and Communications,Hardware and Architecture,Signal Processing,Control and Systems Engineering
Reference53 articles.
1. Pooyandeh, M., Han, K.-J., and Sohn, I. (2022). Cybersecurity in the AI-Based Metaverse: A Survey. Appl. Sci., 12. 2. Development of metaverse for intelligent healthcare;Wang;Nat. Mach. Intell.,2022 3. Mozumder, M.A.I., Sheeraz, M.M., Athar, A., Aich, S., and Kim, H.C. (2022, January 13–16). Overview: Technology Roadmap of the Future Trend of Metaverse based on IoT, Blockchain, AI Technique, and Medical Domain Metaverse Activity. Proceedings of the 2022 24th International Conference on Advanced Communication Technology (ICACT), Pyeongchang-gun, Republich of Korea. 4. Augmented Reality, Artificial Intelligence, and the Re-Enchantment of the World: With Mohammad Yaqub Chaudhary, “Augmented Reality, Artificial Intelligence, and the Re-Enchantment of the World”; and William Young, “Reverend Robot: Automation and Clergy”;Chaudhary;Zygon,2019 5. Ali, S., Armand, T.P.T., Athar, A., Hussain, A., Ali, M., Yaseen, M., Joo, M.-I., and Kim, H.-C. (2023). Metaverse in Healthcare Integrated with Explainable AI and Blockchain: Enabling Immersiveness, Ensuring Trust, and Providing Patient Data Security. Sensors, 23.
|
|