Multiagent Reinforcement Learning Based on Fusion-Multiactor-Attention-Critic for Multiple-Unmanned-Aerial-Vehicle Navigation Control-Reference-Cited by-同舟云学术

Multiagent Reinforcement Learning Based on Fusion-Multiactor-Attention-Critic for Multiple-Unmanned-Aerial-Vehicle Navigation Control

Published:2022-10-10 Issue:19 Volume:15 Page:7426
ISSN:1996-1073
Container-title:Energies
language:en
Short-container-title:Energies

Author:

Jeon Sangwoo^ORCID,Lee Hoeun^ORCID,Kaliappan Vishnu Kumar^ORCID,Nguyen Tuan Anh^ORCID,Jo Hyungeun,Cho Hyeonseo,Min Dugki

Abstract

The proliferation of unmanned aerial vehicles (UAVs) has spawned a variety of intelligent services, where efficient coordination plays a significant role in increasing the effectiveness of cooperative execution. However, due to the limited operational time and range of UAVs, achieving highly efficient coordinated actions is difficult, particularly in unknown dynamic environments. This paper proposes a multiagent deep reinforcement learning (MADRL)-based fusion-multiactor-attention-critic (F-MAAC) model for multiple UAVs’ energy-efficient cooperative navigation control. The proposed model is built on the multiactor-attention-critic (MAAC) model, which offers two significant advances. The first is the sensor fusion layer, which enables the actor network to utilize all required sensor information effectively. Next, a layer that computes the dissimilarity weights of different agents is added to compensate for the information lost through the attention layer of the MAAC model. We utilize the UAV LDS (logistic delivery service) environment created by the Unity engine to train the proposed model and verify its energy efficiency. The feature that measures the total distance traveled by the UAVs is incorporated with the UAV LDS environment to validate the energy efficiency. To demonstrate the performance of the proposed model, the F-MAAC model is compared with several conventional reinforcement learning models with two use cases. First, we compare the F-MAAC model to the DDPG, MADDPG, and MAAC models based on the mean episode rewards for 20k episodes of training. The two top-performing models (F-MAAC and MAAC) are then chosen and retrained for 150k episodes. Our study determines the total amount of deliveries done within the same period and the total amount done within the same distance to represent energy efficiency. According to our simulation results, the F-MAAC model outperforms the MAAC model, making 38% more deliveries in 3000 time steps and 30% more deliveries per 1000 m of distance traveled.

Funder

National Research Foundation of Korea

Publisher

MDPI AG

Subject

Energy (miscellaneous),Energy Engineering and Power Technology,Renewable Energy, Sustainability and the Environment,Electrical and Electronic Engineering,Control and Optimization,Engineering (miscellaneous),Building and Construction

Link

https://www.mdpi.com/1996-1073/15/19/7426/pdf

Reference29 articles.

1. A proposal of methodology for multi-UAV mission modeling;Roldán;Proceedings of the 2015 23rd Mediterranean Conference on Control and Automation (MED),2015

2. Comprehensive Energy Consumption Model for Unmanned Aerial Vehicles, Based on Empirical Studies of Battery Performance

3. Cooperative task assignment of multi-UAV system

4. Deep Reinforcement Learning for Multiagent Systems: A Review of Challenges, Solutions, and Applications

5. Multi-UAV Mobile Edge Computing and Path Planning Platform Based on Reinforcement Learning

Cited by 3 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Actor-Hybrid-Attention-Critic for Multi-Logistic Robots Path Planning;IEEE Robotics and Automation Letters;2024-06

2. Multi-Agent Deep Reinforcement Learning for Multi-Robot Applications: A Survey;Sensors;2023-03-30

3. Robust Multi-Agent Reinforcement Learning Method Based on Adversarial Domain Randomization for Real-World Dual-UAV Cooperation;IEEE Transactions on Intelligent Vehicles;2023