Path Planning Method for Manipulators Based on Improved Twin Delayed Deep Deterministic Policy Gradient and RRT*-Reference-Cited by-同舟云学术

Path Planning Method for Manipulators Based on Improved Twin Delayed Deep Deterministic Policy Gradient and RRT*

Published:2024-03-26 Issue:7 Volume:14 Page:2765
ISSN:2076-3417
Container-title:Applied Sciences
language:en
Short-container-title:Applied Sciences

Author:

Cai Ronggui¹,Li Xiao¹²

Affiliation:

1. School of Electronic Engineering and Automation, Guilin University of Electronic Technology, Guilin 541004, China

2. Key Laboratory of Intelligence Integrated Automation in Guangxi Universities, Guilin 541004, China

Abstract

This paper proposes a path planning framework that combines the experience replay mechanism from deep reinforcement learning (DRL) and rapidly exploring random tree star (RRT*), employing the DRL-RRT* as the path planning method for the manipulator. The iteration of the RRT* is conducted independently in path planning, resulting in a tortuous path and making it challenging to find an optimal path. The setting of reward functions in policy learning based on DRL is very complex and has poor universality, making it difficult to complete the task in complex path planning. Aiming at the insufficient exploration of the current deterministic policy gradient DRL algorithm twin delayed deep deterministic policy gradient (TD3), a stochastic policy was combined with TD3, and the performance was verified on the simulation platform. Furthermore, the improved TD3 was integrated with RRT* for performance analysis in two-dimensional (2D) and three-dimensional (3D) path planning environments. Finally, a six-degree-of-freedom manipulator was used to conduct simulation and experimental research on the manipulator.

Funder

Innovation Project of Guilin University of Electronic Technology (GUET) Graduate Education

Key Laboratory of Automatic Testing Technology and Instruments Foundation of Guangxi

Publisher

MDPI AG

Link

https://www.mdpi.com/2076-3417/14/7/2765/pdf

Reference37 articles.

1. Path Planning of Greenhouse Robot Based on Fusion of Improved A* Algorithm and Dynamic Window Approach;Lao;Nongye Jixie Xuebao/Trans. Chin. Soc. Agric. Mach.,2021

2. Probabilistic roadmaps for path planning in high-dimensional configuration spaces;Kavraki;IEEE Trans. Robot. Autom.,1996

3. MOD-RRT*: A Sampling-Based Algorithm for Robot Path Planning in Dynamic Environment;Qi;IEEE Trans. Ind. Electron.,2021

4. Viseras, A., Shutin, D., and Merino, L. (2017, January 24–28). Online information gathering using sampling-based planners and GPs: An information theoretic approach. Proceedings of the 2017 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), Vancouver, BC, Canada.

5. Gammell, J.D., Srinivasa, S.S., and Barfoot, T.D. (2014, January 14–18). Informed RRT*: Optimal Sampling-based Path Planning Focused via Direct Sampling of an Admissible Ellipsoidal Heuristic. Proceedings of the 2014 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), Chicago, IL, USA.