A Multi-Stage Deep Reinforcement Learning with Search-Based Optimization for Air–Ground Unmanned System Navigation-Reference-Cited by-同舟云学术

A Multi-Stage Deep Reinforcement Learning with Search-Based Optimization for Air–Ground Unmanned System Navigation

Published:2023-02-09 Issue:4 Volume:13 Page:2244
ISSN:2076-3417
Container-title:Applied Sciences
language:en
Short-container-title:Applied Sciences

Author:

Chen Xiaohui¹,Qi Yuhua¹,Yin Yizhen¹,Chen Yidong²,Liu Li²,Chen Hongbo¹

Affiliation:

1. School of System Science and Engineering, Sun Yat-Sen University, Guangzhou 510006, China

2. China Academy of Launch Vehicle Technology, Beijing 100076, China

Abstract

An important challenge for air–ground unmanned systems achieving autonomy is navigation, which is essential for them to accomplish various tasks in unknown environments. This paper proposes an end-to-end framework for solving air–ground unmanned system navigation using deep reinforcement learning (DRL) while optimizing by using a priori information from search-based path planning methods, which we call search-based optimizing DRL (SO-DRL) for the air–ground unmanned system. SO-DRL enables agents, i.e., an unmanned aerial vehicle (UAV) or an unmanned ground vehicle (UGV) to move to a given target in a completely unknown environment using only Lidar, without additional mapping or global planning. Our framework is equipped with Deep Deterministic Policy Gradient (DDPG), an actor–critic-based reinforcement learning algorithm, to input the agents’ state and laser scan measurements into the network and map them to continuous motion control. SO-DRL draws on current excellent search-based algorithms to demonstrate path planning and calculate rewards for its behavior. The demonstrated strategies are replayed in an experienced pool along with the autonomously trained strategies according to their priority. We use a multi-stage training approach based on course learning to train SO-DRL on the 3D simulator Gazebo and verify the robustness and success of the algorithm using new test environments for path planning in unknown environments. The experimental results show that SO-DRL can achieve faster algorithm convergence and a higher success rate. We piggybacked SO-DRL directly onto a real air–ground unmanned system, and SO-DRL can guide a UAV or UGV for navigation without adjusting any networks.

Funder

Research on Path Planning Algorithm of Swarm Unmanned System Based on Deep Reinforcement Learning of China University Industry, Education and Research Innovation Fund

Publisher

MDPI AG

Subject

Fluid Flow and Transfer Processes,Computer Science Applications,Process Chemistry and Technology,General Engineering,Instrumentation,General Materials Science

Link

https://www.mdpi.com/2076-3417/13/4/2244/pdf

Reference46 articles.

1. Special issue on ontologies and standards for intelligent systems: Editorial;Olszewska;Knowl. Eng. Rev.,2022

2. Aircraft visual inspection: A systematic literature review;Yasuda;Comput. Ind.,2022

3. Intelligent spraying robot for building walls with mobility and perception;Wang;Autom. Constr.,2022

4. Szrek, J., Zimroz, R., Wodecki, J., Michalak, A., Góralczyk, M., and Worsa-Kozak, M. (2020). Application of the infrared thermography and unmanned ground vehicle for rescue action support in underground mine—The amicos project. Remote Sens., 13.

5. MUDE-based control of quadrotor for accurate attitude tracking;Qi;Control Eng. Pract.,2021

Cited by 8 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Unmanned Ground Vehicle Path Planning Based on Improved DRL Algorithm;Electronics;2024-06-25

2. Vision-based collaborative robots for exploration in uneven terrains;Mechatronics;2024-06

3. A Novel Hybrid Genetic and A-star Algorithm for UAV Path Optimization;2024 IEEE 1st Karachi Section Humanitarian Technology Conference (KHI-HTC);2024-01-08

4. Cooperative Landing on Mobile Platform for Multiple Unmanned Aerial Vehicles via Reinforcement Learning;Journal of Aerospace Engineering;2024-01

5. UAVs for Disaster Management - An Exploratory Review;Procedia Computer Science;2024