Improved Q-Learning Method for Linear Discrete-Time Systems-Reference-Cited by-同舟云学术

Improved Q-Learning Method for Linear Discrete-Time Systems

Published:2020-03-22 Issue:3 Volume:8 Page:368
ISSN:2227-9717
Container-title:Processes
language:en
Short-container-title:Processes

Author:

Chen Jian,Wang Jinhua,Huang Jie

Abstract

In this paper, the Q-learning method for quadratic optimal control problem of discrete-time linear systems is reconsidered. The theoretical results prove that the quadratic optimal controller cannot be solved directly due to the linear correlation of the data sets. The following corollaries have been made: (1) The correlation of data is the key factor in the success for the calculation of quadratic optimal control laws by Q-learning method; (2) The control laws for linear systems cannot be derived directly by the existing Q-learning method; (3) For nonlinear systems, there are some doubts about the data independence of current method. Therefore, it is necessary to discuss the probability of the controllers established by the existing Q-learning method. To solve this problem, based on the ridge regression, an improved model-free Q-learning quadratic optimal control method for discrete-time linear systems is proposed in this paper. Therefore, the computation process can be implemented correctly, and the effective controller can be solved. The simulation results show that the proposed method can not only overcome the problem caused by the data correlation, but also derive proper control laws for discrete-time linear systems.

Funder

National Natural Science Foundation of China

Publisher

MDPI AG

Subject

Process Chemistry and Technology,Chemical Engineering (miscellaneous),Bioengineering

Link

https://www.mdpi.com/2227-9717/8/3/368/pdf

Reference38 articles.

1. Reinforcement Learning and Feedback Control: Using Natural Decision Methods to Design Optimal Adaptive Controllers;Lewis;Control Syst. IEEE,2012

2. On adaptive control processes

3. Discrete time multivariable adaptive control;Ramadge;IEEE Trans. Autom. Control,1980

4. Q-learning

5. Spectral–Spatial Hyperspectral Image Classification Based on KNN

Cited by 1 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Special Issue “Active Flow Control Processes with Machine Learning and the Internet of Things”;Processes;2023-04-28