The Study of Crash-Tolerant, Multi-Agent Offensive and Defensive Games Using Deep Reinforcement Learning-Reference-Cited by-同舟云学术

The Study of Crash-Tolerant, Multi-Agent Offensive and Defensive Games Using Deep Reinforcement Learning

Published:2023-01-08 Issue:2 Volume:12 Page:327
ISSN:2079-9292
Container-title:Electronics
language:en
Short-container-title:Electronics

Author:

Li Xilun^ORCID,Li Zhan^ORCID,Zheng Xiaolong^ORCID,Yang Xuebo^ORCID,Yu Xinghu^ORCID

Abstract

In the multi-agent offensive and defensive game (ODG), each agent achieves its goal by cooperating or competing with other agents. The multi-agent deep reinforcement learning (MADRL) method is applied in similar scenarios to help agents make decisions. In various situations, the agents of both sides may crash due to collisions. However, the existing algorithms cannot deal with the situation where the number of agents reduces. Based on the multi-agent deep deterministic policy gradient (MADDPG) algorithm, we study a method to deal with a reduction in the number of agents in the training process without changing the structure of the neural network (NN), which is called the frozen agent method for the MADDPG (FA-MADDPG) algorithm. In addition, we design a distance–collision reward function to help agents learn strategies better. Through the experiments in four scenarios with different numbers of agents, it is verified that the algorithm we proposed can not only successfully deal with the problem of agent number reduction in the training stage but also show better performance and higher efficiency than the MADDPG algorithm in simulation.

Funder

National Natural Science Foundation of China

Publisher

MDPI AG

Subject

Electrical and Electronic Engineering,Computer Networks and Communications,Hardware and Architecture,Signal Processing,Control and Systems Engineering

Link

https://www.mdpi.com/2079-9292/12/2/327/pdf

Reference22 articles.

1. Multi-player pursuit–evasion games with one superior evader;Chen;Automatica,2016

2. Hamilton–Jacobi Formulation for Reach–Avoid Differential Games;Margellos;IEEE Trans. Autom. Control.,2011

3. Cooperative pursuit with Voronoi partitions;Zhou;Automatica,2016

4. Multiplayer reach-avoid games via pairwise outcomes;Chen;IEEE Trans. Autom. Control.,2016

5. Zou, B., and Peng, X. (2022). A Bilateral Cooperative Strategy for Swarm Escort under the Attack of Aggressive Swarms. Electronics, 11.

Cited by 4 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Online Surveillance of IoT Agents in Smart Cities Using Deep Reinforcement Learning;International Journal of Intelligent Information Technologies;2024-07-26

2. Bidirectional Long Short-Term Memory (Bi-LSTM) Hourly Energy Forecasting;E3S Web of Conferences;2024

3. Checkers Game Therapy to Improve the Mental Ability Of Alzheimer’s Patient using AI Virtual Assistant;2023 Second International Conference on Augmented Intelligence and Sustainable Systems (ICAISS);2023-08-23

4. Multi-Agent Task Planning Algorithm Based on Region Parameter Sharing;2023 IEEE International Conference on Mechatronics and Automation (ICMA);2023-08-06