Optimal trajectory tracking for uncertain linear discrete‐time systems using time‐varying <i>Q</i>‐learning-Reference-Cited by-同舟云学术

Optimal trajectory tracking for uncertain linear discrete‐time systems using time‐varying Q‐learning

Published:2024-04-24 Issue:7 Volume:38 Page:2340-2368
ISSN:0890-6327
Container-title:International Journal of Adaptive Control and Signal Processing
language:en
Short-container-title:Adaptive Control & Signal

Author:

Geiger Maxwell¹^ORCID,Narayanan Vignesh²^ORCID,Jagannathan Sarangapani¹^ORCID

Affiliation:

1. Department of Electrical and Computer Engineering Missouri University of Science and Technology Rolla Missouri USA

2. Department of Computer Science and Engineering University of South Carolina Columbia South Carolina USA

Abstract

SummaryThis article introduces a novel optimal trajectory tracking control scheme designed for uncertain linear discrete‐time (DT) systems. In contrast to traditional tracking control methods, our approach removes the requirement for the reference trajectory to align with the generator dynamics of an autonomous dynamical system. Moreover, it does not demand the complete desired trajectory to be known in advance, whether through the generator model or any other means. Instead, our approach can dynamically incorporate segments (finite horizons) of reference trajectories and autonomously learn an optimal control policy to track them in real time. To achieve this, we address the tracking problem by learning a time‐varying ‐function through state feedback. This ‐function is then utilized to calculate the optimal feedback gain and explicitly time‐varying feedforward control input, all without the need for prior knowledge of the system dynamics or having the complete reference trajectory in advance. Additionally, we introduce an adaptive observer to extend the applicability of the tracking control scheme to situations where full state measurements are unavailable. We rigorously establish the closed‐loop stability of our optimal adaptive control approach, both with and without the adaptive observer, employing Lyapunov theory. Moreover, we characterize the optimality of the controller with respect to the finite horizon length of the known components of the desired trajectory. To further enhance the controller's adaptability and effectiveness in multitask environments, we employ the Efficient Lifelong Learning Algorithm, which leverages a shared knowledge base within the recursive least squares algorithm for multitask ‐learning. The efficacy of our approach is substantiated through a comprehensive set of simulation results by using a power system example.

Funder

Office of Naval Research

Publisher

Wiley

Link

https://onlinelibrary.wiley.com/doi/pdf/10.1002/acs.3807

Reference28 articles.

1. Optimal Control

2. Predefined-Time Adaptive Fuzzy Control for a Class of Nonlinear Systems With Output Hysteresis

3. Finite-Time Adaptive Neural Control for a Class of Nonlinear Systems With Asymmetric Time-Varying Full-State Constraints

4. Reinforcement learning and adaptive dynamic programming for feedback control