Author:
Ren Jing,Huang Xishi,Huang Raymond N.
Abstract
In this paper, we propose a novel deep reinforcement learning (DRL) method for optimal path planning for mobile robots using dynamic programming (DP)-based data collection. The proposed method can overcome the slow learning process and improve training data quality inherently in DRL algorithms. The main idea of our approach is as follows. First, we mapped the dynamic programming method to typical optimal path planning problems for mobile robots, and created a new efficient DP-based method to find an exact, analytical, optimal solution for the path planning problem. Then, we used high-quality training data gathered using the DP method for DRL, which greatly improves training data quality and learning efficiency. Next, we established a two-stage reinforcement learning method where, prior to the DRL, we employed extreme learning machines (ELM) to initialize the parameters of actor and critic neural networks to a near-optimal solution in order to significantly improve the learning performance. Finally, we illustrated our method using some typical path planning tasks. The experimental results show that our DRL method can converge much easier and faster than other methods. The resulting action neural network is able to successfully guide robots from any start position in the environment to the goal position while following the optimal path and avoiding collision with obstacles.
Funder
Natural Sciences and Engineering Research Council of Canada
Subject
Electrical and Electronic Engineering,Computer Networks and Communications,Hardware and Architecture,Signal Processing,Control and Systems Engineering
Reference44 articles.
1. Joint Optimization of Multi-UAV Target Assignment and Path Planning Based on Multi-Agent Reinforcement Learning;Qie;IEEE Access,2019
2. Autonomous Navigation of UAVs in Large-Scale Complex Environments: A Deep Reinforcement Learning Approach;Wang;IEEE Trans. Veh. Technol.,2019
3. Deep reinforcement-learning-based driving policy for autonomous road vehicles;Makantasis;IET Intell. Transp. Syst.,2020
4. Garg, A., Chiang, H.-T.L., Sugaya, S., Faust, A., and Tapia, L. Comparison of Deep Reinforcement Learning Policies to Formal Method for Moving Obstacle Avoidance. Proceedings of the 2019 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS).
5. Tung, T.X., and Ngo, D. Socially Aware Robot Navigation Using Deep Reinforcement Learning. Proceedings of the 2018 IEEE Canadian Conference on Electrical & Computer Engineering.
Cited by
6 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献