Learning stabilizable nonlinear dynamics with contraction-based regularization-Reference-Cited by-同舟云学术

Learning stabilizable nonlinear dynamics with contraction-based regularization

Published:2020-08-30 Issue:10-11 Volume:40 Page:1123-1150
ISSN:0278-3649
Container-title:The International Journal of Robotics Research
language:en
Short-container-title:The International Journal of Robotics Research

Author:

Singh Sumeet¹^ORCID,Richards Spencer M¹^ORCID,Sindhwani Vikas²,Slotine Jean-Jacques E³,Pavone Marco¹

Affiliation:

1. Department of Aeronautics and Astronautics, Stanford University, Stanford, CA, USA

2. Google Brain Robotics, New York, NY, USA

3. Department of Mechanical Engineering, Massachusetts Institute of Technology, Cambridge, MA, USA

Abstract

We propose a novel framework for learning stabilizable nonlinear dynamical systems for continuous control tasks in robotics. The key contribution is a control-theoretic regularizer for dynamics fitting rooted in the notion of stabilizability, a constraint which guarantees the existence of robust tracking controllers for arbitrary open-loop trajectories generated with the learned system. Leveraging tools from contraction theory and statistical learning in reproducing kernel Hilbert spaces, we formulate stabilizable dynamics learning as a functional optimization with a convex objective and bi-convex functional constraints. Under a mild structural assumption and relaxation of the functional constraints to sampling-based constraints, we derive the optimal solution with a modified representer theorem. Finally, we utilize random matrix feature approximations to reduce the dimensionality of the search parameters and formulate an iterative convex optimization algorithm that jointly fits the dynamics functions and searches for a certificate of stabilizability. We validate the proposed algorithm in simulation for a planar quadrotor, and on a quadrotor hardware testbed emulating planar dynamics. We verify, both in simulation and on hardware, significantly improved trajectory generation and tracking performance with the control-theoretic regularized model over models learned using traditional regression techniques, especially when learning from small supervised datasets. The results support the conjecture that the use of stabilizability constraints as a form of regularization can help prune the hypothesis space in a manner that is tailored to the downstream task of trajectory generation and feedback control. This produces models that are not only dramatically better conditioned, but also data efficient.

Funder

King Abdulaziz City for Science and Technology

NASA Space Technology Research Grants Program

national science foundation

Publisher

SAGE Publications

Subject

Applied Mathematics,Artificial Intelligence,Electrical and Electronic Engineering,Mechanical Engineering,Modeling and Simulation,Software

Link

http://journals.sagepub.com/doi/pdf/10.1177/0278364920949931

Reference67 articles.

1. Learning quadrotor dynamics using neural network for flight control

2. Goal-driven dynamics learning via Bayesian optimization

Cited by 25 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. How Generalizable is My Behavior Cloning Policy? A Statistical Approach to Trustworthy Performance Evaluation;IEEE Robotics and Automation Letters;2024-10

2. Fusion dynamical systems with machine learning in imitation learning: A comprehensive overview;Information Fusion;2024-08

3. Structural Risk Minimization for Learning Nonlinear Dynamics;2024 American Control Conference (ACC);2024-07-10

4. Adaptive Neural Stochastic Control With Lipschitz Constant Optimization;IEEE Transactions on Circuits and Systems I: Regular Papers;2024-07

5. Learning of Hamiltonian Dynamics with Reproducing Kernel Hilbert Spaces;2024 European Control Conference (ECC);2024-06-25