Safe adaptive output‐feedback optimal control of a class of linear systems-Reference-Cited by-同舟云学术

Safe adaptive output‐feedback optimal control of a class of linear systems

Published:2024-04-10 Issue:11 Volume:34 Page:7082-7095
ISSN:1049-8923
Container-title:International Journal of Robust and Nonlinear Control
language:en
Short-container-title:Intl J Robust & Nonlinear

Author:

Mahmud S M Nahid¹,Abudia Moad²,Nivison Scott A.³,Bell Zachary I.³,Kamalapurkar Rushikesh²

Affiliation:

1. School of Aeronautics and Astronautics Purdue University West Lafayette Indiana USA

2. School of Mechanical and Aerospace Engineering Oklahoma State University Stillwater Oklahoma USA

3. Air Force Research Laboratory Eglin AFB Florida USA

Abstract

AbstractThe objective of this research is to enable safety‐critical systems to simultaneously learn and execute optimal control policies in a safe manner to achieve complex autonomy. Learning optimal policies via trial and error, that is, traditional reinforcement learning, is difficult to implement in safety‐critical systems, particularly when task restarts are unavailable. Safe model‐based reinforcement learning techniques based on a barrier transformation have recently been developed to address this problem. However, these methods rely on full‐state feedback, limiting their usability in a real‐world environment. In this work, an output‐feedback safe model‐based reinforcement learning technique based on a novel barrier‐aware dynamic state estimator has been designed to address this issue. The developed approach facilitates simultaneous learning and execution of safe control policies for safety‐critical linear systems. Simulation results indicate that barrier transformation is an effective approach to achieve online reinforcement learning in safety‐critical systems using output feedback.

Funder

Air Force Research Laboratory

National Science Foundation

Office of Naval Research

Publisher

Wiley

Link

https://onlinelibrary.wiley.com/doi/pdf/10.1002/rnc.7334

Reference34 articles.

1. Reinforcement learning is direct adaptive optimal control

2. Reinforcement Learning in Continuous Time and Space

3. Real-time reinforcement learning by sequential Actor–Critics and experience replay

4. Efficient model-based reinforcement learning for approximate online optimal control

5. Model-based reinforcement learning for approximate optimal regulation