Learning Physically Simulated Tennis Skills from Broadcast Videos-Reference-Cited by-同舟云学术

Learning Physically Simulated Tennis Skills from Broadcast Videos

Published:2023-07-26 Issue:4 Volume:42 Page:1-14
ISSN:0730-0301
Container-title:ACM Transactions on Graphics
language:en
Short-container-title:ACM Trans. Graph.

Author:

Zhang Haotian¹^ORCID,Yuan Ye²^ORCID,Makoviychuk Viktor²^ORCID,Guo Yunrong³^ORCID,Fidler Sanja³⁴^ORCID,Peng Xue Bin⁵⁶^ORCID,Fatahalian Kayvon¹^ORCID

Affiliation:

1. Stanford University, Stanford, United States of America

2. NVIDIA, Santa Clara, United States of America

3. NVIDIA, Toronto, Canada

4. University of Toronto, Toronto, Canada

5. NVIDIA, Vancouver, Canada

6. Simon Fraser University, Vancouver, Canada

Abstract

We present a system that learns diverse, physically simulated tennis skills from large-scale demonstrations of tennis play harvested from broadcast videos. Our approach is built upon hierarchical models, combining a low-level imitation policy and a high-level motion planning policy to steer the character in a motion embedding learned from broadcast videos. When deployed at scale on large video collections that encompass a vast set of examples of real-world tennis play, our approach can learn complex tennis shotmaking skills and realistically chain together multiple shots into extended rallies, using only simple rewards and without explicit annotations of stroke types. To address the low quality of motions extracted from broadcast videos, we correct estimated motion with physics-based imitation, and use a hybrid control policy that overrides erroneous aspects of the learned motion embedding with corrections predicted by the high-level policy. We demonstrate that our system produces controllers for physically-simulated tennis players that can hit the incoming ball to target positions accurately using a diverse array of strokes (serves, forehands, and backhands), spins (topspins and slices), and playing styles (one/two-handed backhands, left/right-handed play). Overall, our system can synthesize two physically simulated characters playing extended tennis rallies with simulated racket and ball dynamics. Code and data for this work is available at https://research.nvidia.com/labs/toronto-ai/vid2player3d/.

Publisher

Association for Computing Machinery (ACM)

Subject

Computer Graphics and Computer-Aided Design

Link

https://dl.acm.org/doi/pdf/10.1145/3592408

Reference81 articles.

1. Exploiting Temporal Context for 3D Human Pose Estimation in the Wild

2. Samy Bengio , Oriol Vinyals , Navdeep Jaitly , and Noam Shazeer . 2015. Scheduled sampling for sequence prediction with recurrent neural networks. Advances in neural information processing systems 28 ( 2015 ). Samy Bengio, Oriol Vinyals, Navdeep Jaitly, and Noam Shazeer. 2015. Scheduled sampling for sequence prediction with recurrent neural networks. Advances in neural information processing systems 28 (2015).

3. DReCon

4. Alexey Bochkovskiy , Chien-Yao Wang , and Hong-Yuan Mark Liao . 2020. Yolov4: Optimal speed and accuracy of object detection. arXiv preprint arXiv:2004.10934 ( 2020 ). Alexey Bochkovskiy, Chien-Yao Wang, and Hong-Yuan Mark Liao. 2020. Yolov4: Optimal speed and accuracy of object detection. arXiv preprint arXiv:2004.10934 (2020).

5. Keep It SMPL: Automatic Estimation of 3D Human Pose and Shape from a Single Image

Cited by 13 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Learning Prehensile Dexterity by Imitating and Emulating State-Only Observations;IEEE Robotics and Automation Letters;2024-10

2. MoConVQ: Unified Physics-Based Motion Control via Scalable Discrete Representations;ACM Transactions on Graphics;2024-07-19

3. Strategy and Skill Learning for Physics-based Table Tennis Animation;Special Interest Group on Computer Graphics and Interactive Techniques Conference Conference Papers '24;2024-07-13

4. Lifelike agility and play in quadrupedal robots using reinforcement learning and generative pre-trained models;Nature Machine Intelligence;2024-07-05

5. Multi-Stage Contrastive Regression for Action Quality Assessment;ICASSP 2024 - 2024 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP);2024-04-14