Velocity Control of a Multi-Motion Mode Spherical Probe Robot Based on Reinforcement Learning-Reference-Cited by-同舟云学术

Velocity Control of a Multi-Motion Mode Spherical Probe Robot Based on Reinforcement Learning

Published:2023-07-15 Issue:14 Volume:13 Page:8218
ISSN:2076-3417
Container-title:Applied Sciences
language:en
Short-container-title:Applied Sciences

Author:

Ma Wenke¹²,Li Bingyang²³,Cao Yuxue⁴,Wang Pengfei²,Liu Mengyue²,Chang Chenyang³,Peng Shigang¹²

Affiliation:

1. Qian Xuesen Laboratory of Space Technology, China Academy of Space Technology, Beijing 100094, China

2. China Academy of Aerospace Science and Innovation, Beijing 102600, China

3. College of Engineering, Peking University, Beijing 100871, China

4. Beijing Institute of Control Engineering, Beijing 100190, China

Abstract

As deep space exploration tasks become increasingly complex, the mobility and adaptability of traditional wheeled or tracked probe robots with high functional density are constrained in harsh, dangerous, or unknown environments. A practical solution to these challenges is designing a probe robot for preliminary exploration in unknown areas, which is characterized by robust adaptability, simple structure, light weight, and minimal volume. Compared to the traditional deep space probe robot, the spherical robot with a geometric, symmetrical structure shows better adaptability to the complex ground environment. Considering the uncertain detection environment, the spherical robot should brake rapidly after jumping to avoid reentering obstacles. Moreover, since it is equipped with optical modules for deep space exploration missions, the spherical robot must maintain motion stability during the rolling process to ensure the quality of photos and videos captured. However, due to the nonlinear coupling and parameter uncertainty of the spherical robot, it is tedious to adjust controller parameters. Moreover, the adaptability of controllers with fixed parameters is limited. This paper proposes an adaptive proportion–integration–differentiation (PID) control method based on reinforcement learning for the multi-motion mode spherical probe robot (MMSPR) with rolling and jumping. This method uses the soft actor–critic (SAC) algorithm to adjust the parameters of the PID controller and introduces a switching control strategy to reduce static error. As the simulation results show, this method can facilitate the MMSPR’s convergence within 0.02 s regarding motion stability. In addition, in terms of braking, it enables an MMSPR with random initial speed brake within a convergence time of 0.045 s and a displacement of 0.0013 m. Compared with the PID method with fixed parameters, the braking displacement of the MMSPR is reduced by about 38%, and the convergence time is reduced by about 20%, showing better universality and adaptability.

Funder

Technology 173 Program Technical Field Fund

Publisher

MDPI AG

Subject

Fluid Flow and Transfer Processes,Computer Science Applications,Process Chemistry and Technology,General Engineering,Instrumentation,General Materials Science

Link

https://www.mdpi.com/2076-3417/13/14/8218/pdf

Reference33 articles.

1. Design and Kinematics of Mechanically Coupled Two Identical Spherical Robots;Sagsoz;J. Intell. Robot. Syst.,2023

2. Li, M., Sun, H., Ma, L., Gao, P., Huo, D., Wang, Z., and Sun, P. (2023). Special spherical mobile robot for planetary surface exploration: A review. Int. J. Adv. Robot. Syst., 20.

3. Chi, X., and Zhan, Q. (2021). Design and modelling of an amphibious spherical robot attached with assistant fins. Appl. Sci., 11.

4. Design, Implementation and Control of an Amphibious Spherical Robot;Shi;J. Bionic Eng.,2022

5. Design and Development of Spherical Spy Robot for Surveillance Operation;Rangapur;Procedia Comput. Sci.,2020

Cited by 1 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Optimizing Subway Train Operation With Hierarchical Adaptive Control Approach;IEEE Access;2023