Non‐zero‐sum games of discrete‐time Markov jump systems with unknown dynamics: An off‐policy reinforcement learning method-Reference-Cited by-同舟云学术

Non‐zero‐sum games of discrete‐time Markov jump systems with unknown dynamics: An off‐policy reinforcement learning method

Published:2023-09-28 Issue:2 Volume:34 Page:949-968
ISSN:1049-8923
Container-title:International Journal of Robust and Nonlinear Control
language:en
Short-container-title:Intl J Robust & Nonlinear

Author:

Zhang Xuewen¹,Shen Hao¹^ORCID,Li Feng¹^ORCID,Wang Jing¹

Affiliation:

1. School of Electrical and Information Engineering Anhui University of Technology Ma'anshan China

Abstract

AbstractThis article concentrates on the non‐zero‐sum games problem of discrete‐time Markov jump systems without requiring the system dynamics information. First, the multiplayer non‐zero‐sum games problem can be converted to solve a set of coupled game algebraic Riccati equations, which is difficult to be solved directly. Then, to obtain the optimal control policies, a model‐based algorithm adapting the policy iteration approach is proposed. However, the model‐based algorithm relies on system dynamics information, which has the limitations in practice. Subsequently, an off‐policy reinforcement learning algorithm is given to get rid of the dependence on system dynamics information, which only uses the information of system states and inputs. Moreover, the proof of convergence and Nash equilibrium are also given. Finally, a numerical example is given to demonstrate the effectiveness of the proposed algorithms.

Funder

National Natural Science Foundation of China

Natural Science Foundation of Anhui Province

Publisher

Wiley

Subject

Electrical and Electronic Engineering,Industrial and Manufacturing Engineering,Mechanical Engineering,Aerospace Engineering,Biomedical Engineering,General Chemical Engineering,Control and Systems Engineering

Link

https://onlinelibrary.wiley.com/doi/pdf/10.1002/rnc.7021

Reference48 articles.

1. Passivity-Based Asynchronous Control for Markov Jump Systems

2. Stability and stabilization of discrete-time singular Markov jump systems with time-varying delay

3. Discrete-Time Markov Jump Linear Systems

4. Stability of discrete-time linear systems with Markovian jumping parameters and constrained control

Cited by 3 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Initial Excitation-Based Optimal Control for Continuous-Time Linear Nonzero-Sum Games;IEEE Transactions on Systems, Man, and Cybernetics: Systems;2024-09

2. Anti‐modal‐asynchrony sliding mode control for semi‐Markov jump systems;International Journal of Robust and Nonlinear Control;2024-06-12

3. Multi‐event‐triggered adaptive dynamic programming for non‐zero‐sum game of unknown nonlinear system;International Journal of Robust and Nonlinear Control;2024-02-13