Action-conditional implicit visual dynamics for deformable object manipulation-Reference-Cited by-同舟云学术

Action-conditional implicit visual dynamics for deformable object manipulation

Published:2023-07-28 Issue: Volume: Page:
ISSN:0278-3649
Container-title:The International Journal of Robotics Research
language:en
Short-container-title:The International Journal of Robotics Research

Author:

Shen Bokui¹^ORCID,Jiang Zhenyu²,Choy Christopher³,Savarese Silvio¹,Guibas Leonidas J.²,Anandkumar Anima³⁴,Zhu Yuke²³

Affiliation:

1. Computer Science Department, Stanford University, Stanford, CA, USA

2. Computer Science Department, The University of Texas at Austin, Austin, TX, USA

3. AI Algorithm Research, Nvidia Inc., Santa Clara, CA, USA

4. Computer Science Department, California Institute of Technology, Pasadena, CA, USA

Abstract

Manipulating volumetric deformable objects in the real world, like plush toys and pizza dough, brings substantial challenges due to infinite shape variations, non-rigid motions, and partial observability. We introduce ACID, an action-conditional visual dynamics model for volumetric deformable objects based on structured implicit neural representations. ACID integrates two new techniques: implicit representations for action-conditional dynamics and geodesics-based contrastive learning. To represent deformable dynamics from partial RGB-D observations, we learn implicit representations of occupancy and flow-based forward dynamics. To accurately identify state change under large non-rigid deformations, we learn a correspondence embedding field through a novel geodesics-based contrastive loss. To evaluate our approach, we develop a simulation framework for manipulating complex deformable shapes in realistic scenes and a benchmark containing over 17,000 action trajectories with six types of plush toys and 78 variants. Our model achieves the best performance in geometry, correspondence, and dynamics predictions over existing approaches. The ACID dynamics models are successfully employed for goal-conditioned deformable manipulation tasks, resulting in a 30% increase in task success rate over the strongest baseline. Furthermore, we apply the simulation-trained ACID model directly to real-world objects and show success in manipulating them into target configurations. https://b0ku1.github.io/acid/

Funder

Division of Information and Intelligent Systems

Publisher

SAGE Publications

Subject

Applied Mathematics,Artificial Intelligence,Electrical and Electronic Engineering,Mechanical Engineering,Modeling and Simulation,Software

Link

http://journals.sagepub.com/doi/pdf/10.1177/02783649231191222

Reference97 articles.

1. Efficient Geometry-aware 3D Generative Adversarial Networks

2. Chang AX, Funkhouser T, Guibas L, et al. (2015). Shapenet: an information-rich 3d model repository. arXiv preprint arXiv:1512.03012.