Autonomous and cooperative control of UAV cluster with multi-agent reinforcement learning-Reference-Cited by-同舟云学术

Autonomous and cooperative control of UAV cluster with multi-agent reinforcement learning

Published:2022-01-13 Issue:1300 Volume:126 Page:932-951
ISSN:0001-9240
Container-title:The Aeronautical Journal
language:en
Short-container-title:Aeronaut. j.

Author:

Xu D.^ORCID,Chen G.

Abstract

AbstractIn this paper, we expolore Multi-Agent Reinforcement Learning (MARL) methods for unmanned aerial vehicle (UAV) cluster. Considering that the current UAV cluster is still in the program control stage, the fully autonomous and intelligent cooperative combat has not been realised. In order to realise the autonomous planning of the UAV cluster according to the changing environment and cooperate with each other to complete the combat goal, we propose a new MARL framework. It adopts the policy of centralised training with decentralised execution, and uses Actor-Critic network to select the execution action and then to make the corresponding evaluation. The new algorithm makes three key improvements on the basis of Multi-Agent Deep Deterministic Policy Gradient (MADDPG) algorithm. The first is to improve learning framework; it makes the calculated Q value more accurate. The second is to add collision avoidance setting, which can increase the operational safety factor. And the third is to adjust reward mechanism; it can effectively improve the cluster’s cooperative ability. Then the improved MADDPG algorithm is tested by performing two conventional combat missions. The simulation results show that the learning efficiency is obviously improved, and the operational safety factor is further increased compared with the previous algorithm.

Publisher

Cambridge University Press (CUP)

Subject

Aerospace Engineering

Reference52 articles.

1. Continuous control with deep reinforcement learning;Lillicrap;International Conference on Learning Representations,2015

2. Dynamic target tracking and observing in a mobile sensor network

3. Unmanned Aircraft Systems Airspace Integration: A Game Theoretical Framework for Concept Evaluations

4. Autonomous navigation of UAV by using real-time model-based reinforcement learning

Cited by 19 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Optimization of latching control for duck wave energy converter based on deep reinforcement learning;Ocean Engineering;2024-10

2. Advancement Challenges in UAV Swarm Formation Control: A Comprehensive Review;Drones;2024-07-12

3. Ontology-Oriented Multy-Agent System for Decentralized Control of UAV's Group;Kibernetika i vyčislitelʹnaâ tehnika;2024-06-26

4. A Review of Collaborative Trajectory Planning for Multiple Unmanned Aerial Vehicles;Processes;2024-06-20

5. Leader–follower UAVs formation control based on a deep Q-network collaborative framework;Scientific Reports;2024-02-26