Three-Dimensional Path Planning of UAVs in a Complex Dynamic Environment Based on Environment Exploration Twin Delayed Deep Deterministic Policy Gradient-Reference-Cited by-同舟云学术

Three-Dimensional Path Planning of UAVs in a Complex Dynamic Environment Based on Environment Exploration Twin Delayed Deep Deterministic Policy Gradient

Published:2023-07-05 Issue:7 Volume:15 Page:1371
ISSN:2073-8994
Container-title:Symmetry
language:en
Short-container-title:Symmetry

Author:

Zhang Danyang¹,Li Xiongwei¹,Ren Guoquan¹,Yao Jiangyi¹,Chen Kaiyan¹,Li Xi¹

Affiliation:

1. Shijiazhuang Campus, Army Engineering University, Shijiazhuang 050003, China

Abstract

Unmanned Aerial Vehicle (UAV) path planning research refers to the UAV automatically planning an optimal path to the destination under the corresponding environment, while avoiding collision with obstacles in this process. In order to solve the problem of 3D path planning of UAV in a dynamic environment, a heuristic dynamic reward function is designed to guide the UAV. We propose the Environment Exploration Twin Delayed Deep Deterministic Policy Gradient (EE-TD3) algorithm, which combines the symmetrical 3D environment exploration coding mechanism on the basis of TD3 algorithm. The EE-TD3 algorithm model can effectively avoid collisions, improve the training efficiency, and achieve faster convergence speed. Finally, the performance of the EE-TD3 algorithm and other deep reinforcement learning algorithms was tested in the simulation environment. The results show that the EE-TD3 algorithm is better than other algorithms in solving the 3D path planning problem of UAV.

Funder

National Natural Science Foundation of China

Publisher

MDPI AG

Subject

Physics and Astronomy (miscellaneous),General Mathematics,Chemistry (miscellaneous),Computer Science (miscellaneous)

Link

https://www.mdpi.com/2073-8994/15/7/1371/pdf

Reference33 articles.

1. Toward a fully autonomous UAV: Research platform for indoor and outdoor urbansearch and rescue;Tomic;IEEE Robot. Autom. Mag.,2012

2. Stevens, R., Sadjadi, F., Braegelmann, J., Cordes, A., and Nelson, R. (2008, January 16–20). Small unmanned aerial vehicle (UAV) real-time intelligence, surveillance and reconnaissance (ISR) using onboard pre-processing. Proceedings of the Automatic Target Recognition XVIII, Orlando, FL, USA.

3. An active disturbance rejection approach to leader-follower controlled formation;Asian J. Control,2014

4. Deep reinforcement learning-based content placement and trajectory design in urban cache-enabled UAV networks;Wu;Wirel. Commun. Mob. Comput.,2020

5. Path planning for solar-powered UAV in urban environment;Wu;Neurocomputing,2018