DTC: Deep Tracking Control-Reference-Cited by-同舟云学术

DTC: Deep Tracking Control

Published:2024-01-17 Issue:86 Volume:9 Page:
ISSN:2470-9476
Container-title:Science Robotics
language:en
Short-container-title:Sci. Robot.

Author:

Jenelten Fabian¹^ORCID,He Junzhe¹^ORCID,Farshidian Farbod¹,Hutter Marco¹^ORCID

Affiliation:

1. Robotic Systems Lab, ETH Zurich, 8092 Zurich, Switzerland.

Abstract

Legged locomotion is a complex control problem that requires both accuracy and robustness to cope with real-world challenges. Legged systems have traditionally been controlled using trajectory optimization with inverse dynamics. Such hierarchical model-based methods are appealing because of intuitive cost function tuning, accurate planning, generalization, and, most importantly, the insightful understanding gained from more than one decade of extensive research. However, model mismatch and violation of assumptions are common sources of faulty operation. Simulation-based reinforcement learning, on the other hand, results in locomotion policies with unprecedented robustness and recovery skills. Yet, all learning algorithms struggle with sparse rewards emerging from environments where valid footholds are rare, such as gaps or stepping stones. In this work, we propose a hybrid control architecture that combines the advantages of both worlds to simultaneously achieve greater robustness, foot-placement accuracy, and terrain generalization. Our approach uses a model-based planner to roll out a reference motion during training. A deep neural network policy is trained in simulation, aiming to track the optimized footholds. We evaluated the accuracy of our locomotion pipeline on sparse terrains, where pure data-driven methods are prone to fail. Furthermore, we demonstrate superior robustness in the presence of slippery or deformable ground when compared with model-based counterparts. Last, we show that our proposed tracking controller generalizes across different trajectory optimization methods not seen during training. In conclusion, our work unites the predictive capabilities and optimality guarantees of online planning with the inherent robustness attributed to offline learning.

Publisher

American Association for the Advancement of Science (AAAS)

Subject

Artificial Intelligence,Control and Optimization,Computer Science Applications,Mechanical Engineering

Link

https://www.science.org/doi/pdf/10.1126/scirobotics.adh5401

Reference38 articles.

1. J. Z. Kolter M. P. Rodgers A. Y. Ng A control architecture for quadruped locomotion over rough terrain in 2008 IEEE International Conference on Robotics and Automation (IEEE 2008) pp. 811–818.

2. M. Kalakrishnan J. Buchli P. Pastor M. Mistry S. Schaal Fast robust quadruped locomotion over challenging terrain in 2010 IEEE International Conference on Robotics and Automation (IEEE 2010) pp. 2665–2670.

3. Gait and Trajectory Optimization for Legged Systems Through Phase-Based End-Effector Parameterization

4. Motion Planning for Quadrupedal Locomotion: Coupled Planning, Terrain Mapping, and Whole-Body Control

5. Perceptive Locomotion in Rough Terrain – Online Foothold Optimization