Homotopic policy iteration-based learning design for unknown linear continuous-time systems-Reference-Cited by-同舟云学术

Homotopic policy iteration-based learning design for unknown linear continuous-time systems

Published:2022-04 Issue: Volume:138 Page:110153
ISSN:0005-1098
Container-title:Automatica
language:en
Short-container-title:Automatica

Author:

Chen Ci,Lewis Frank L.,Li Bo

Publisher

Elsevier BV

Subject

Electrical and Electronic Engineering,Control and Systems Engineering

Reference44 articles.

1. Abbasi-Yadkori, Y., & Szepesvári, C. (2011). Regret bounds for the adaptive control of linear quadratic systems. In Proceedings of the 24th annual conference on learning theory, JMLR workshop and conference proceedings, (pp. 1–26).

2. Real-time optimization by extremum-seeking control;Ariyur,2003

3. Adaptive control;Åström,2013

4. Dynamic programming and optimal control, vol. 1;Bertsekas,1995

5. Value iteration and adaptive dynamic programming for data-driven adaptive optimal control design;Bian;Automatica,2016

Cited by 17 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Novel single-loop policy iteration for linear zero-sum games;Automatica;2024-05

2. Specified convergence rate guaranteed output tracking of discrete-time systems via reinforcement learning;Automatica;2024-03

3. Learning Optimal Control Policy for Unknown Discrete-Time Systems;IEEE Transactions on Circuits and Systems II: Express Briefs;2023-11

4. Distributed Minmax Strategy for Consensus Tracking in Differential Graphical Games: A Model-Free Approach;IEEE Systems, Man, and Cybernetics Magazine;2023-10

5. Optimal control of unknown nonlinear system under event‐triggered mechanism and identifier‐critic‐actor architecture;International Journal of Robust and Nonlinear Control;2023-09-05