An Obstacle-Avoidance Motion Planning Method for Redundant Space Robot via Reinforcement Learning-Reference-Cited by-同舟云学术

An Obstacle-Avoidance Motion Planning Method for Redundant Space Robot via Reinforcement Learning

Published:2023-02-08 Issue:2 Volume:12 Page:69
ISSN:2076-0825
Container-title:Actuators
language:en
Short-container-title:Actuators

Author:

Huang Zeyuan¹^ORCID,Chen Gang¹,Shen Yue¹^ORCID,Wang Ruiquan¹,Liu Chuankai²,Zhang Long³^ORCID

Affiliation:

1. School of Automation, Beijing University of Posts and Telecommunications, Beijing 100876, China

2. Beijing Aerospace Control Center, Beijing 100094, China

3. Technology and Engineering Center for Space Utilization, Chinese Academy of Sciences, Beijing 100094, China

Abstract

On-orbit operation tasks require the space robot to work in an unstructured dynamic environment, where the end-effector’s trajectory and obstacle avoidance need to be guaranteed simultaneously. To ensure the completability and safety of the tasks, this paper proposes a new obstacle-avoidance motion planning method for redundant space robots via reinforcement learning (RL). First, the motion planning framework, which combines RL with the null-space motion for redundant space robots, is proposed according to the decomposition of joint motion. Second, the RL model for null-space obstacle avoidance is constructed, where the RL agent’s state and reward function are defined independent of the specific information of obstacles so that it can adapt to dynamic environmental changes. Finally, a curriculum learning-based training strategy for RL agents is designed to improve sample efficiency, training stability, and obstacle-avoidance performance. The simulation shows that the proposed method realizes reactive obstacle avoidance while maintaining the end-effector’s predetermined trajectory, as well as the adaptability to unstructured dynamic environments and robustness to the space robot’s dynamic parameters.

Funder

BUPT Action Plan to Enhance Capacity for Scientific and Technological Innovation

BUPT Excellent Ph.D. Students Foundation

National Natural Science Foundation of China

Science and Technology Foundation of State Key Laboratory

Publisher

MDPI AG

Subject

Control and Optimization,Control and Systems Engineering

Link

https://www.mdpi.com/2076-0825/12/2/69/pdf

Reference37 articles.