Inverse kinematics solution and control method of 6-degree-of-freedom manipulator based on deep reinforcement learning-Reference-Cited by-同舟云学术

Inverse kinematics solution and control method of 6-degree-of-freedom manipulator based on deep reinforcement learning

Published:2024-05-30 Issue:1 Volume:14 Page:
ISSN:2045-2322
Container-title:Scientific Reports
language:en
Short-container-title:Sci Rep

Author:

Zhao Chengyi,Wei Yimin,Xiao Junfeng,Sun Yong,Zhang Dongxing,Guo Qiuquan,Yang Jun

Abstract

AbstractThe advent of Industry 4.0 has significantly promoted the field of intelligent manufacturing, which is facilitated by the development of new technologies are emerging. Robot technology and robot intelligence methods have rapidly developed and been widely applied. Manipulators are widely used in industry, and their control is a crucial research topic. The inverse kinematics solution of manipulators is an important part of manipulator control, which calculates the joint angles required for the end effector to reach a desired position and posture. Traditional inverse kinematics solution algorithms often face the problem of insufficient generalization, and iterative methods have challenges such as large computation and long solution time. This paper proposes a reinforcement learning-based inverse kinematics solution algorithm, called the MAPPO-IK algorithm. The algorithm trains the manipulator agent using the MAPPO algorithm and calculates the difference between the end effector state of the manipulator and the target posture in real-time by designing a reward mechanism, while considering Gaussian distance and cosine distance. Through experimental comparative analysis, the feasibility, computational efficiency, and superiority of this reinforcement learning algorithm are verified. Compared with traditional inverse kinematics solution algorithms, this method has good generalization and supports real-time computation, and the obtained result is a unique solution. Reinforcement learning algorithms have better adaptability to complex environments and can handle different sudden situations in different environments. This algorithm also has the advantages of path planning, intelligent obstacle avoidance, and other advantages in dynamically processing complex environmental scenes.

Funder

Basic and Applied Basic Research Foundation of Guangdong Province

Science, Technology and Innovation Commission of Shenzhen Municipality

Publisher

Springer Science and Business Media LLC

Link

https://www.nature.com/articles/s41598-024-62948-6.pdf

Reference37 articles.

1. Lin, G. et al. An inverse kinematics solution for a series-parallel hybrid banana-harvesting robot based on deep reinforcement learning. Agronomy 12(9), 2157 (2022).

2. Malik, A. et al. A deep reinforcement-learning approach for inverse kinematics solution of a high degree of freedom robotic manipulator. Robotics 11(2), 44 (2022).

3. Zhengyong, F. & Yinfu, Z. A fast training method for intelligent control of robot arm based on deep reinforcement learning. Comput. Eng. 48(8), 8 (2022).

4. Stifter, S. Algebraic methods for computing inverse kinematics. J. Intell. Robot. Syst. 11, 79–89 (1994).

5. Lee, C. S. G. & Ziegler, M. Geometric approach in solving inverse kinematics of PUMA robots. IEEE Trans. Aerosp. Electron. Syst. 6, 695–706 (1984).