Safe Model-Based Reinforcement Learning for Systems With Parametric Uncertainties-Reference-Cited by-同舟云学术

Safe Model-Based Reinforcement Learning for Systems With Parametric Uncertainties

Published:2021-12-16 Issue: Volume:8 Page:
ISSN:2296-9144
Container-title:Frontiers in Robotics and AI
language:
Short-container-title:Front. Robot. AI

Author:

Mahmud S. M. Nahid,Nivison Scott A.,Bell Zachary I.,Kamalapurkar Rushikesh

Abstract

Reinforcement learning has been established over the past decade as an effective tool to find optimal control policies for dynamical systems, with recent focus on approaches that guarantee safety during the learning and/or execution phases. In general, safety guarantees are critical in reinforcement learning when the system is safety-critical and/or task restarts are not practically feasible. In optimal control theory, safety requirements are often expressed in terms of state and/or control constraints. In recent years, reinforcement learning approaches that rely on persistent excitation have been combined with a barrier transformation to learn the optimal control policies under state constraints. To soften the excitation requirements, model-based reinforcement learning methods that rely on exact model knowledge have also been integrated with the barrier transformation framework. The objective of this paper is to develop safe reinforcement learning method for deterministic nonlinear systems, with parametric uncertainties in the model, to learn approximate constrained optimal policies without relying on stringent excitation conditions. To that end, a model-based reinforcement learning technique that utilizes a novel filtered concurrent learning method, along with a barrier transformation, is developed in this paper to realize simultaneous learning of unknown model parameters and approximate optimal state-constrained control policies for safety-critical systems.

Funder

Air Force Research Laboratory

Publisher

Frontiers Media SA

Subject

Artificial Intelligence,Computer Science Applications

Reference36 articles.

1. Finite-time Parameter Estimation in Adaptive Control of Nonlinear Systems;Adetola;IEEE Trans. Automat. Contr.,2008

2. Control Barrier Function Based Quadratic Programs for Safety Critical Systems;Ames;IEEE Trans. Automat. Contr.,2017

3. Adaptive Control with Guaranteed Transient and Steady State Tracking Error Bounds for Strict Feedback Systems;Bechlioulis;Automatica,2009

4. An Actor-Critic-Identifier Architecture for Adaptive Approximate Optimal Control;Bhasin,2012

Cited by 10 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Reinforcement learning control for USVs using prescribed performance sliding surfaces and an event-triggered strategy;Ocean Engineering;2024-08

2. Neuromechanical Model-free Epistemic Risk Guided Exploration (NeuroMERGE) for Safe Autonomy in Human-Robot Interaction;2024 American Control Conference (ACC);2024-07-10

3. Situational awareness and state estimation tools for search and localize missions with stationary targets in formation;Automatic Target Recognition XXXIV;2024-06-07

4. Neuro-Dynamic Control of an Above Knee Prosthetic Leg;SN Computer Science;2024-05-02

5. Safe adaptive output‐feedback optimal control of a class of linear systems;International Journal of Robust and Nonlinear Control;2024-04-10