Research on mobile robot path planning in complex environment based on DRQN algorithm-Reference-Cited by-同舟云学术

Research on mobile robot path planning in complex environment based on DRQN algorithm

Published:2024-06-14 Issue:7 Volume:99 Page:076012
ISSN:0031-8949
Container-title:Physica Scripta
language:
Short-container-title:Phys. Scr.

Author:

Wang Shuai,Du Yuhong^ORCID,Lin Jingxuan,Zhao Shuaijie

Abstract

Abstract A deep reinforcement Q learning algorithm (DRQN) based on radial neural network is proposed to achieve path planning and obstacle avoidance for mobile robots in complex ground environments with different types of obstacles, including static and dynamic obstacles. Firstly, the path planning problem is represented as a partially-observed Markov decision process. Steering angle, running characteristics, and other elements are introduced into the state-action decision space and the greedy factor is dynamically adjusted using a simulated annealing algorithm, which improves the mobile robot’s environment exploration and action selection accuracy. Secondly, the Q-learning algorithm is improved by replacing the Q-table structure with an RBF neural network to enhance the approximation ability of the algorithm’s function values, and the parameters of the implicit layer and the weights between the implicit and the output layer are trained using the dynamic clustering and least-mean methods respectively, which improves the convergence speed and enhances the ability of mobile robots to handle large-scale computation. Lastly, the double reward mechanism is set up to prevent the mobile robot from blind searching in unknown environments, which enhances the learning ability and improves path planning safety and flexibility at the same time. Different types of scenarios are set up for simulation experiments, and the results verified the superiority of the DQRN algorithm. Taking the 30 * 30 complex scene as an example, using the DQRN algorithm for path planning reduces the values of distance, turning angle, and planning time by 27.04%, 7.76%, and 28.05%, respectively, compared to the average values of Q-learning, optimized Q-learning, deep Q-learning, and DDPG algorithms, which can effectively improve the path planning efficiency for mobile robots in complex environments.

Funder

Tianjin Science and Technology Plan

Publisher

IOP Publishing

Link

https://iopscience.iop.org/article/10.1088/1402-4896/ad551b/pdf

Reference33 articles.

1. Autonomous vehicle path planning for smart logistics mobile applications based on modified heuristic algorithm;Fusic;Meas. Sci. Technol.,2023

2. Reinforcement learning for mobile robotics exploration: a survey;Garaffa;IEEE Trans Neural Netw. Learn. Syst.,2021

3. Region coverage-aware path planning for unmanned aerial vehicles: a systematic review;Kumar;Physical Communication,2023

4. The review unmanned surface vehicle path planning: based on multi-modality constraint;Zhou;Ocean Eng.,2020

5. An improved fault-tolerant cultural-PSO with probability for multi-AGV path planning;Lin;Expert Systems With Application,2023