Multi-UAV Autonomous Path Planning in Reconnaissance Missions Considering Incomplete Information: A Reinforcement Learning Method-Reference-Cited by-同舟云学术

Multi-UAV Autonomous Path Planning in Reconnaissance Missions Considering Incomplete Information: A Reinforcement Learning Method

Published:2022-12-23 Issue:1 Volume:7 Page:10
ISSN:2504-446X
Container-title:Drones
language:en
Short-container-title:Drones

Author:

Chen Yu^ORCID,Dong Qi^ORCID,Shang Xiaozhou,Wu Zhenyu,Wang Jinyu

Abstract

Unmanned aerial vehicles (UAVs) are important in reconnaissance missions because of their flexibility and convenience. Vitally, UAVs are capable of autonomous navigation, which means they can be used to plan safe paths to target positions in dangerous surroundings. Traditional path-planning algorithms do not perform well when the environmental state is dynamic and partially observable. It is difficult for a UAV to make the correct decision with incomplete information. In this study, we proposed a multi-UAV path planning algorithm based on multi-agent reinforcement learning which entails the adoption of centralized training–decentralized execution architecture to coordinate all the UAVs. Additionally, we introduced a hidden state of the recurrent neural network to utilize the historical observation information. To solve the multi-objective optimization problem, We designed a joint reward function to guide UAVs to learn optimal policies under the multiple constraints. The results demonstrate that by using our method, we were able to solve the problem of incomplete information and low efficiency caused by partial observations and sparse rewards in reinforcement learning, and we realized kdiff multi-UAV cooperative autonomous path planning in unknown environment.

Funder

Open Fund of Anhui Province Key Laboratory of Cyberspace Security Situation Awareness and Evaluation

Publisher

MDPI AG

Subject

Artificial Intelligence,Computer Science Applications,Aerospace Engineering,Information Systems,Control and Systems Engineering

Link

https://www.mdpi.com/2504-446X/7/1/10/pdf

Reference31 articles.

1. A survey of single and multi-UAV aerial manipulation;Mohiuddin;Unmanned Syst.,2020

2. Stern, R. (2019). Artificial Intelligence, Springer.

3. Ma, H., Wagner, G., Felner, A., Li, J., Kumar, T., and Koenig, S. (2018). Multi-agent path finding with deadlines. arXiv.

4. Ngatchou, P., Zarei, A., and El-Sharkawi, A. (2005, January 6–10). Pareto multi objective optimization. Proceedings of the 13th International Conference on, Intelligent Systems Application to Power Systems, Arlington, VA, USA.

5. Multi-objective optimization using genetic algorithms: A tutorial;Konak;Reliab. Eng. Syst. Saf.,2006

Cited by 24 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. A novel multi-objective dung beetle optimizer for Multi-UAV cooperative path planning;Heliyon;2024-09

2. Holistic Review of UAV-Centric Situational Awareness: Applications, Limitations, and Algorithmic Challenges;Robotics;2024-07-29

3. Simulation Training System for Parafoil Motion Controller Based on Actor–Critic RL Approach;Actuators;2024-07-25

4. A Deep Reinforcement Learning Algorithm for Trajectory Planning of Swarm UAV Fulfilling Wildfire Reconnaissance;Electronics;2024-06-30

5. UAV flight path planning optimization;Telecommunication Systems;2024-06-27