Affiliation:
1. School of Aerospace Engineering Georgia Institute of Technology Atlanta Georgia USA
Abstract
SummaryIn this paper, we develop an online learning algorithm for solving the Bellman equation for affine in the control discrete‐time nonlinear uncertain dynamical systems. To ensure accelerated learning of our algorithm in generating optimal control policies, we use an actor‐critic structure predicated on higher‐order tuner laws. More specifically, we construct a Nesterov‐like architecture involving momentum‐based learning laws leading to an accelerated convergence of the optimal control policy. The proposed online learning‐based optimal control framework guarantees uniform ultimate boundedness of the closed‐loop system under the assumption that the system is persistently excited. Finally, two illustrative numerical examples are provided to demonstrate the efficacy of the proposed approach.
Funder
Air Force Office of Scientific Research
National Science Foundation of Sri Lanka
Alexander S. Onassis Public Benefit Foundation
Subject
Electrical and Electronic Engineering,Signal Processing,Control and Systems Engineering