Model‐free optimal tracking over finite horizon using adaptive dynamic programming-Reference-Cited by-同舟云学术

Model‐free optimal tracking over finite horizon using adaptive dynamic programming

Published:2023-06-27 Issue:6 Volume:44 Page:3114-3138
ISSN:0143-2087
Container-title:Optimal Control Applications and Methods
language:en
Short-container-title:Optim Control Appl Methods

Author:

Jha Mayank Shekhar¹^ORCID,Theilliol Didier¹,Weber Philippe¹

Affiliation:

1. Centre de Recherche en Automatique de Nancy (CRAN) UMR 7039, CNRS Faculté des Sciences et Technologies, Université de Lorraine Vandoeuvre Cedex France

Abstract

AbstractAdaptive dynamic programming (ADP) based approaches are effective for solving nonlinear Hamilton–Jacobi–Bellman (HJB) in an approximative sense. This paper develops a novel ADP‐based approach, in that the focus is on minimizing the consecutive changes in control inputs over a finite horizon to solve the optimal tracking problem for completely unknown discrete time systems. To that end, the cost function considers within its arguments: tracking performance, energy consumption and as a novelty, consecutive changes in the control inputs. Through suitable system transformation, the optimal tracking problem is transformed to a regulation problem with respect to state tracking error. The latter leads to a novel performance index function over finite horizon and corresponding nonlinear HJB equation that is solved in an approximative iterative sense using a novel iterative ADP‐based algorithm. A suitable Neural network‐based structure is proposed to learn the initial admissible one step zero control law. The proposed iterative ADP is implemented using heuristic dynamic programming technique based on actor‐critic Neural Network structure. Finally, simulation studies are presented to illustrate the effectiveness of the proposed algorithm.

Publisher

Wiley

Subject

Applied Mathematics,Control and Optimization,Software,Control and Systems Engineering

Link

https://onlinelibrary.wiley.com/doi/pdf/10.1002/oca.3028

Reference33 articles.

1. Optimal Control

2. LQ control of unknown discrete-time linear systems-A novel approach and a comparison study

3. Online reinforcement learning for a class of partially unknown continuous-time nonlinear systems via value iteration

Cited by 1 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Off‐policy model‐based end‐to‐end safe reinforcement learning;International Journal of Robust and Nonlinear Control;2023-11-20