An efficient model‐free adaptive optimal control of continuous‐time nonlinear non‐zero‐sum games based on integral reinforcement learning with exploration-Reference-Cited by-同舟云学术

An efficient model‐free adaptive optimal control of continuous‐time nonlinear non‐zero‐sum games based on integral reinforcement learning with exploration

Published:2023-12-24 Issue: Volume: Page:
ISSN:1751-8644
Container-title:IET Control Theory & Applications
language:en
Short-container-title:IET Control Theory & Appl

Author:

Guo Lei¹^ORCID,Xiong Wenbo¹,Song Yuan¹,Gan Dongming²

Affiliation:

1. School of Artificial Intelligence Beijing University of Posts and Telecommunications Beijing China

2. School of Engineering Technology Purdue University West Lafayette Indiana USA

Abstract

AbstractTo reduce the learning time and space occupation, this study presents a novel model‐free algorithm for obtaining the Nash equilibrium solution of continuous‐time nonlinear non‐zero‐sum games. Based on the integral reinforcement learning method, a new integral HJ equation that can quickly and cooperatively determine the Nash equilibrium strategies of all players is proposed. By leveraging the neural network approximation and gradient descent method, simultaneous continuous‐time adaptive tuning laws are provided for both critic and actor neural network weights. These laws facilitate the estimation of the optimal value function and optimal policy without requiring knowledge or identification of the system's dynamics. The closed‐loop system stability and convergence of weights are guaranteed through the Lyapunov analysis. Additionally, the algorithm is enhanced to reduce the number of auxiliary NNs used in the critic. The simulation results for a two‐player non‐zero‐sum game validate the effectiveness of the proposed algorithm.

Funder

National Natural Science Foundation of China

Publisher

Institution of Engineering and Technology (IET)

Subject

Electrical and Electronic Engineering,Control and Optimization,Computer Science Applications,Human-Computer Interaction,Control and Systems Engineering

Reference43 articles.

1. Non-zero sum differential game for stochastic Markovian jump systems with partially unknown transition probabilities

2. Evasion-Pursuit Strategy against Defended Aircraft Based on Differential Game Theory

3. Differential game theory for versatile physical human–robot interaction

4. A comparative analysis of express packaging waste recycling models based on the differential game theory

5. Dynamic Noncooperative Game Theory, 2nd Edition