Efficient Path Planning for Mobile Robot Based on Deep Deterministic Policy Gradient

Author:

Gong Hui,Wang PengORCID,Ni Cui,Cheng Nuo

Abstract

When a traditional Deep Deterministic Policy Gradient (DDPG) algorithm is used in mobile robot path planning, due to the limited observable environment of mobile robots, the training efficiency of the path planning model is low, and the convergence speed is slow. In this paper, Long Short-Term Memory (LSTM) is introduced into the DDPG network, the former and current states of the mobile robot are combined to determine the actions of the robot, and a Batch Norm layer is added after each layer of the Actor network. At the same time, the reward function is optimized to guide the mobile robot to move faster towards the target point. In order to improve the learning efficiency, different normalization methods are used to normalize the distance and angle between the mobile robot and the target point, which are used as the input of the DDPG network model. When the model outputs the next action of the mobile robot, mixed noise composed of Gaussian noise and Ornstein–Uhlenbeck (OU) noise is added. Finally, the simulation environment built by a ROS system and a Gazebo platform is used for experiments. The results show that the proposed algorithm can accelerate the convergence speed of DDPG, improve the generalization ability of the path planning model and improve the efficiency and success rate of mobile robot path planning.

Funder

China Postdoctoral Science Foundation

Science and Technology Project of Shandong Provincial Department of Transportation

Publisher

MDPI AG

Subject

Electrical and Electronic Engineering,Biochemistry,Instrumentation,Atomic and Molecular Physics, and Optics,Analytical Chemistry

Cited by 24 articles. 订阅此论文施引文献 订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献

1. Deep Reinforcement Learning Based Mobile Robot Navigation in Crowd Environments;2024 21st International Conference on Ubiquitous Robots (UR);2024-06-24

2. Research on mobile robot path planning in complex environment based on DRQN algorithm;Physica Scripta;2024-06-14

3. Towards Developing a Framework for Autonomous Electric Vehicles Using CARLA: A Validation Using the Deep Deterministic Policy Gradient Algorithm;2024 32nd Mediterranean Conference on Control and Automation (MED);2024-06-11

4. Mapless Navigation for Mobile Robots Based on Improved Soft Actor-Critic Algorithm;2024 39th Youth Academic Annual Conference of Chinese Association of Automation (YAC);2024-06-07

5. Robot path planning algorithm with improved DDPG algorithm;International Journal on Interactive Design and Manufacturing (IJIDeM);2024-05-29

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3