Robotic Manipulator in Dynamic Environment with SAC Combing Attention Mechanism and LSTM-Reference-Cited by-同舟云学术

Robotic Manipulator in Dynamic Environment with SAC Combing Attention Mechanism and LSTM

Published:2024-05-17 Issue:10 Volume:13 Page:1969
ISSN:2079-9292
Container-title:Electronics
language:en
Short-container-title:Electronics

Author:

Kuang Xinghong¹,Zhou Sucheng¹

Affiliation:

1. School of Engineering, Shanghai Ocean University, Shanghai 201306, China

Abstract

The motion planning task of the manipulator in a dynamic environment is relatively complex. This paper uses the improved Soft Actor Critic Algorithm (SAC) with the maximum entropy advantage as the benchmark algorithm to implement the motion planning of the manipulator. In order to solve the problem of insufficient robustness in dynamic environments and difficulty in adapting to environmental changes, it is proposed to combine Euclidean distance and distance difference to improve the accuracy of approaching the target. In addition, in order to solve the problem of non-stability and uncertainty of the input state in the dynamic environment, which leads to the inability to fully express the state information, we propose an attention network fused with Long Short-Term Memory (LSTM) to improve the SAC algorithm. We conducted simulation experiments and present the experimental results. The results prove that the use of fused neural network functions improved the success rate of approaching the target and improved the SAC algorithm at the same time, which improved the convergence speed, success rate, and avoidance capabilities of the algorithm.

Funder

National Key Research and Development Program of China

Publisher

MDPI AG

Link

https://www.mdpi.com/2079-9292/13/10/1969/pdf

Reference38 articles.

1. Bhuiyan, T., Kästner, L., Hu, Y., Kutschank, B., and Lambrecht, J. (2023, January 21–23). Deep-Reinforcement-Learning-based Path Planning for Industrial Robots using Distance Sensors as Observation. Proceedings of the 2023 8th International Conference on Control and Robotics Engineering (ICCRE), Niigata, Japan.

2. A robot arm digital twin utilising reinforcement learning;Matulis;Comput. Graph.,2021

3. Said, A., Talj, R., Francis, C., and Shraim, H. (2021, January 19–22). Local trajectory planning for autonomous vehicle with static and dynamic obstacles avoidance. Proceedings of the 2021 IEEE International Intelligent Transportation Systems Conference (ITSC), Indianapolis, IN, USA.

4. Palmieri, G., and Scoccia, C. (2021). Motion planning and control of redundant manipulators for dynamical obstacle avoidance. Machines, 9.

5. Azizi, M.R., Rastegarpanah, A., and Stolkin, R. (2021). Motion planning and control of an omnidirectional mobile robot in dynamic environments. Robotics, 10.