Online Adaptive Dynamic Programming-Based Solution of Networked Multiple-Pursuer and Single-Evader Game-Reference-Cited by-同舟云学术

Online Adaptive Dynamic Programming-Based Solution of Networked Multiple-Pursuer and Single-Evader Game

Published:2022-11-02 Issue:21 Volume:11 Page:3583
ISSN:2079-9292
Container-title:Electronics
language:en
Short-container-title:Electronics

Author:

Gong Zifeng,He Bing,Hu Chen,Zhang Xiaobo,Kang Weijie

Abstract

This paper presents a new scheme for the online solution of a networked multi-agent pursuit–evasion game based on an online adaptive dynamic programming method. As a multi-agent in the game can form an Internet of Things (IoT) system, by incorporating the relative distance and the control energy as the performance index, the expression of the policies when the agents reach the Nash equilibrium is obtained and proved by the minmax principle. By constructing a Lyapunov function, the capture conditions of the game are obtained and discussed. In order to enable each agent to obtain the policy for reaching the Nash equilibrium in real time, the online adaptive dynamic programming method is used to solve the game problem. Furthermore, the parameters of the neural network are fitted by value function approximation, which avoids the difficulties of solving the Hamilton-Jacobi–Isaacs equation, and the numerical solution of the Nash equilibrium is obtained. Simulation results depict the feasibility of the proposed method for use on multi-agent pursuit–evasion games.

Publisher

MDPI AG

Subject

Electrical and Electronic Engineering,Computer Networks and Communications,Hardware and Architecture,Signal Processing,Control and Systems Engineering

Link

https://www.mdpi.com/2079-9292/11/21/3583/pdf

Reference31 articles.

1. Robust multi-agent differential games with application to cooperative guidance;Liu;Aerosp. Sci. Technol.,2021

2. Optimal evading strategies for two-pursuer/one-evader problems;Makkapati;J. Guid. Control Dyn.,2018

3. Zhang, H., Ren, H., Mu, Y., and Han, J. Optimal consensus control design for multiagent systems with multiple time delay using adaptive dynamic programming. IEEE Trans. Cybern., 2021.

4. Yuan, Z., Wu, T., Wang, Q., Yang, Y., Li, L., and Zhang, L. T3omvp: A transformer-based time and team reinforcement learning scheme for observation-constrained multi-vehicle pursuit in urban area. Electronics, 2022. 11.

5. Satellite proximate pursuit-evasion game with different thrust configurations;Shi;Aerosp. Sci. Technol.,2020

Cited by 6 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Adaptive dynamic programming for containment control with robustness analysis to iterative error: A global Nash equilibrium solution;ISA Transactions;2024-08

2. Cooperative control for multi-player pursuit-evasion games embedded on communication technology with reinforcement learning;2023-11-07

3. Intelligent Escape of Robotic Systems: A Survey of Methodologies, Applications, and Challenges;Journal of Intelligent & Robotic Systems;2023-10-31

4. Nonlinear Multi-Object Differential Game Simulation Model in LabVIEW;Electronics;2023-09-11

5. Theoretic Solution of Multi-Player Pursuit-Evasion Game Based on Adaptive Dynamic Programming;2023 42nd Chinese Control Conference (CCC);2023-07-24