Neural Categorical Priors for Physics-Based Character Control-Reference-Cited by-同舟云学术

Neural Categorical Priors for Physics-Based Character Control

Published:2023-12-05 Issue:6 Volume:42 Page:1-16
ISSN:0730-0301
Container-title:ACM Transactions on Graphics
language:en
Short-container-title:ACM Trans. Graph.

Author:

Zhu Qingxu¹,Zhang He¹,Lan Mengting²,Han Lei¹

Affiliation:

1. Tencent Robotics X, China

2. XVERSE, China

Abstract

Recent advances in learning reusable motion priors have demonstrated their effectiveness in generating naturalistic behaviors. In this paper, we propose a new learning framework in this paradigm for controlling physics-based characters with improved motion quality and diversity over existing methods. The proposed method uses reinforcement learning (RL) to initially track and imitate life-like movements from unstructured motion clips using the discrete information bottleneck, as adopted in the Vector Quantized Variational AutoEncoder (VQ-VAE). This structure compresses the most relevant information from the motion clips into a compact yet informative latent space, i.e., a discrete space over vector quantized codes. By sampling codes in the space from a trained categorical prior distribution, high-quality life-like behaviors can be generated, similar to the usage of VQ-VAE in computer vision. Although this prior distribution can be trained with the supervision of the encoder's output, it follows the original motion clip distribution in the dataset and could lead to imbalanced behaviors in our setting. To address the issue, we further propose a technique named prior shifting to adjust the prior distribution using curiosity-driven RL. The outcome distribution is demonstrated to offer sufficient behavioral diversity and significantly facilitates upper-level policy learning for downstream tasks. We conduct comprehensive experiments using humanoid characters on two challenging downstream tasks, sword-shield striking and two-player boxing game. Our results demonstrate that the proposed framework is capable of controlling the character to perform considerably high-quality movements in terms of behavioral strategies, diversity, and realism. Videos, codes, and data are available at https://tencent-roboticsx.github.io/NCP/.

Publisher

Association for Computing Machinery (ACM)

Subject

Computer Graphics and Computer-Aided Design

Link

https://dl.acm.org/doi/pdf/10.1145/3618397

Reference86 articles.

1. Learning dexterous in-hand manipulation

2. Marc Bellemare , Sriram Srinivasan , Georg Ostrovski , Tom Schaul , David Saxton , and Remi Munos . 2016. Unifying count-based exploration and intrinsic motivation. Advances in Neural Information Processing Systems 29 ( 2016 ). Marc Bellemare, Sriram Srinivasan, Georg Ostrovski, Tom Schaul, David Saxton, and Remi Munos. 2016. Unifying count-based exploration and intrinsic motivation. Advances in Neural Information Processing Systems 29 (2016).

3. DReCon

4. Christopher M Bishop and Nasser M Nasrabadi . 2006. Pattern Recognition and Machine Learning . Vol. 4 . Springer . Christopher M Bishop and Nasser M Nasrabadi. 2006. Pattern Recognition and Machine Learning. Vol. 4. Springer.

5. Samuel R Bowman , Luke Vilnis , Oriol Vinyals , Andrew M Dai , Rafal Jozefowicz , and Samy Bengio . 2015. Generating sentences from a continuous space. arXiv preprint arXiv:1511.06349 ( 2015 ). Samuel R Bowman, Luke Vilnis, Oriol Vinyals, Andrew M Dai, Rafal Jozefowicz, and Samy Bengio. 2015. Generating sentences from a continuous space. arXiv preprint arXiv:1511.06349 (2015).

Cited by 9 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Categorical Codebook Matching for Embodied Character Controllers;ACM Transactions on Graphics;2024-07-19

2. MoConVQ: Unified Physics-Based Motion Control via Scalable Discrete Representations;ACM Transactions on Graphics;2024-07-19

3. Physics-based Scene Layout Generation from Human Motion;Special Interest Group on Computer Graphics and Interactive Techniques Conference Conference Papers '24;2024-07-13

4. Strategy and Skill Learning for Physics-based Table Tennis Animation;Special Interest Group on Computer Graphics and Interactive Techniques Conference Conference Papers '24;2024-07-13

5. Lifelike agility and play in quadrupedal robots using reinforcement learning and generative pre-trained models;Nature Machine Intelligence;2024-07-05