Path Planning of Unmanned Aerial Vehicle in Complex Environments Based on State-Detection Twin Delayed Deep Deterministic Policy Gradient-Reference-Cited by-同舟云学术

Path Planning of Unmanned Aerial Vehicle in Complex Environments Based on State-Detection Twin Delayed Deep Deterministic Policy Gradient

Published:2023-01-13 Issue:1 Volume:11 Page:108
ISSN:2075-1702
Container-title:Machines
language:en
Short-container-title:Machines

Author:

Zhang Danyang,Xuan Zhaolong,Zhang Yang,Yao Jiangyi,Li Xi,Li Xiongwei

Abstract

This paper investigates the path planning problem of an unmanned aerial vehicle (UAV) for completing a raid mission through ultra-low altitude flight in complex environments. The UAV needs to avoid radar detection areas, low-altitude static obstacles, and low-altitude dynamic obstacles during the flight process. Due to the uncertainty of low-altitude dynamic obstacle movement, this can slow down the convergence of existing algorithm models and also reduce the mission success rate of UAVs. In order to solve this problem, this paper designs a state detection method to encode the environmental state of the UAV’s direction of travel and compress the environmental state space. In considering the continuity of the state space and action space, the SD-TD3 algorithm is proposed in combination with the double-delayed deep deterministic policy gradient algorithm (TD3), which can accelerate the training convergence speed and improve the obstacle avoidance capability of the algorithm model. Further, to address the sparse reward problem of traditional reinforcement learning, a heuristic dynamic reward function is designed to give real-time rewards and guide the UAV to complete the task. The simulation results show that the training results of the SD-TD3 algorithm converge faster than the TD3 algorithm, and the actual results of the converged model are better.

Funder

National Natural Science Foundation of China

Publisher

MDPI AG

Subject

Electrical and Electronic Engineering,Industrial and Manufacturing Engineering,Control and Optimization,Mechanical Engineering,Computer Science (miscellaneous),Control and Systems Engineering

Link

https://www.mdpi.com/2075-1702/11/1/108/pdf

Reference38 articles.

1. Path planning techniques for unmanned aerial vehicles: A review, solutions, and challenges;Aggarwal;Comput. Commun.,2020

2. Tsourdos, A., White, B., and Shanmugavel, M. (2010). Cooperative Path Planning of Unmanned Aerial Vehicles, John Wiley & Sons.

3. Three-dimensional unmanned aerial vehicle path planning using modified wolf pack search algorithm;Chen;Neurocomputing,2017

4. A review of cooperative path planning of an unmanned aerial vehicle group;Zhang;Front. Inf. Technol. Electron. Eng.,2020

5. Cabreira, T.M., Brisolara, L.B., and Paulo, R.F.J. (2019). Survey on coverage path planning with unmanned aerial vehicles. Drones, 3.

Cited by 1 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Robotic vision based obstacle avoidance for navigation of unmanned aerial vehicle using fuzzy rule based optimal deep learning model;Evolutionary Intelligence;2023-09-28