DivaTrack: Diverse Bodies and Motions from Acceleration‐Enhanced Three‐Point Trackers-Reference-Cited by-同舟云学术

DivaTrack: Diverse Bodies and Motions from Acceleration‐Enhanced Three‐Point Trackers

Published:2024-04-27 Issue:2 Volume:43 Page:
ISSN:0167-7055
Container-title:Computer Graphics Forum
language:en
Short-container-title:Computer Graphics Forum

Author:

Yang Dongseok¹^ORCID,Kang Jiho¹^ORCID,Ma Lingni²^ORCID,Greer Joseph²^ORCID,Ye Yuting²^ORCID,Lee Sung‐Hee¹^ORCID

Affiliation:

1. Korea Advanced Institue of Science and Technology

2. Meta Reality Labs

Abstract

AbstractFull‐body avatar presence is important for immersive social and environmental interactions in digital reality. However, current devices only provide three six degrees of freedom (DOF) poses from the headset and two controllers (i.e. three‐point trackers). Because it is a highly under‐constrained problem, inferring full‐body pose from these inputs is challenging, especially when supporting the full range of body proportions and use cases represented by the general population. In this paper, we propose a deep learning framework, DivaTrack, which outperforms existing methods when applied to diverse body sizes and activities. We augment the sparse three‐point inputs with linear accelerations from Inertial Measurement Units (IMU) to improve foot contact prediction. We then condition the otherwise ambiguous lower‐body pose with the predictions of foot contact and upper‐body pose in a two‐stage model. We further stabilize the inferred full‐body pose in a wide range of configurations by learning to blend predictions that are computed in two reference frames, each of which is designed for different types of motions. We demonstrate the effectiveness of our design on a large dataset that captures 22 subjects performing challenging locomotion for three‐point tracking, including lunges, hula‐hooping, and sitting. As shown in a live demo using the Meta VR headset and Xsens IMUs, our method runs in real‐time while accurately tracking a user's motion when they perform a diverse set of movements.

Publisher

Wiley

Link

https://onlinelibrary.wiley.com/doi/pdf/10.1111/cgf.15057

Reference81 articles.

1. Aliakbarian Sadegh Cameron Pashmina Bogo Federica et al. “FLAG: Flow-based 3D Avatar Generation from Sparse Observations”.Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition.20222.

2. CoolMoves

3. Akada Hiroyasu Wang Jian Shimada Soshi et al. “UnrealEgo: A New Dataset for Robust Egocentric 3D Human Motion Capture”.European Conference on Computer Vision (ECCV).20222.

4. Chung Junyoung Gulcehre Caglar Cho KyungHyun andBengio Yoshua. “Empirical evaluation of gated recurrent neural networks on sequence modeling”.arXiv preprint arXiv:1412.3555(2014) 3.