Graph MADDPG with RNN for multiagent cooperative environment-Reference-Cited by-同舟云学术

Graph MADDPG with RNN for multiagent cooperative environment

Published:2023-06-29 Issue: Volume:17 Page:
ISSN:1662-5218
Container-title:Frontiers in Neurorobotics
language:
Short-container-title:Front. Neurorobot.

Author:

Wei Xiaolong,Cui WenPeng,Huang Xianglin,Yang LiFang,Tao Zhulin,Wang Bing

Abstract

Multiagent systems face numerous challenges due to environmental uncertainty, with scalability being a critical issue. To address this, we propose a novel multi-agent cooperative model based on a graph attention network. Our approach considers the relationship between agents and continuous action spaces, utilizing graph convolution and recurrent neural networks to define these relationships. Graph convolution is used to define the relationship between agents, while recurrent neural networks define continuous action spaces. We optimize and model the multiagent system by encoding the interaction weights among agents using the graph neural network and the weights between continuous action spaces using the recurrent neural network. We evaluate the performance of our proposed model by conducting experimental simulations using a 3D wargame engine that involves several unmanned air vehicles (UAVs) acting as attackers and radar stations acting as defenders, where both sides have the ability to detect each other. The results demonstrate that our proposed model outperforms the current state-of-the-art methods in terms of scalability, robustness, and learning efficiency.

Publisher

Frontiers Media SA

Subject

Artificial Intelligence,Biomedical Engineering

Reference47 articles.

1. Neural machine translation by jointly learning to align and translate BahdanauD. ChoK. BengioY. arXiv [Preprint]2014

2. “End-to-end object detection with transformers,”;Carion;ECCV,2020

3. Towards hybrid gait obstacle avoidance for a six wheel-legged robot with payload transportation;Chen;J. Intell. Robot. Syst,2021

4. Flexible gait transition for six wheel-legged robot with unstructured terrains;Chen;Robot. Auton. Syst,2022

5. Bert: Pre-training of deep bidirectional transformers for language understanding DevlinJ. ChangM.-W. LeeK. ToutanovaK. arXiv [Preprint]2018

Cited by 1 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Hierarchical RNNs with graph policy and attention for drone swarm;Journal of Computational Design and Engineering;2024-03-06