Relative control of an underactuated spacecraft using reinforcement learning-Reference-Cited by-同舟云学术

Relative control of an underactuated spacecraft using reinforcement learning

Published:2020-12-10 Issue:4 Volume:2020 Page:43-54
ISSN:1561-9184
Container-title:Technical mechanics
language:
Short-container-title:Teh. Meh.

Author:

Khoroshylov S.V.^ORCID, ,Redka M.O.,

Abstract

The aim of the article is to approximate optimal relative control of an underactuated spacecraft using reinforcement learning and to study the influence of various factors on the quality of such a solution. In the course of this study, methods of theoretical mechanics, control theory, stability theory, machine learning, and computer modeling were used. The problem of in-plane spacecraft relative control using only control actions applied tangentially to the orbit is considered. This approach makes it possible to reduce the propellant consumption of reactive actuators and to simplify the architecture of the control system. However, in some cases, methods of the classical control theory do not allow one to obtain acceptable results. In this regard, the possibility of solving this problem by reinforcement learning methods has been investigated, which allows designers to find control algorithms close to optimal ones as a result of interactions of the control system with the plant using a reinforcement signal characterizing the quality of control actions. The well-known quadratic criterion is used as a reinforcement signal, which makes it possible to take into account both the accuracy requirements and the control costs. A search for control actions based on reinforcement learning is made using the policy iteration algorithm. This algorithm is implemented using the actor–critic architecture. Various representations of the actor for control law implementation and the critic for obtaining value function estimates using neural network approximators are considered. It is shown that the optimal control approximation accuracy depends on a number of features, namely, an appropriate structure of the approximators, the neural network parameter updating method, and the learning algorithm parameters. The investigated approach makes it possible to solve the considered class of control problems for controllers of different structures. Moreover, the approach allows the control system to refine its control algorithms during the spacecraft operation.

Publisher

National Academy of Sciences of Ukraine (Co. LTD Ukrinformnauka)

Reference22 articles.

1. 1. MacIsaac D. Docking at the International Space Station. Phys. Teach. 2014. V. 52. No. 126.

2. 2. Campbell M., Fullmer R. R., Hall C. D. The ION-F formation flying experiments. Advances in the Astronautical Sciences. 2000. V. 105. Pp. 135-149.

3. 3. Smith G. W., DeRocher W. L. Jr. Orbital servicing and remotely manned systems. Mechanism and Machine Theory. 1977. V. 12. Pp. 65-76.

4. 4. Alpatov A. P., Khoroshylov S. V., Maslova A. I. Contactless De-Orbiting of Space Debris by the Ion Beam. Dynamics and Control. - Kyiv: Akademperiodyka, 2019. 170 pp.

5. 5. Vassar R. H., Sherwood R. B. Formationkeeping for a pair of satellites in a circular orbit. Journal of Guidance, Control, and Dynamics. 1985. V. 8(2). Pp. 235-242.

Cited by 3 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. SPACECRAFT RELATIVE ON-OFF CONTROL VIA REINFORCEMEN T LEARNING;Kosmìčna nauka ì tehnologìâ;2024-04-29

2. Problems in the system analysis of space activities in Ukraine. Rocket and spacecraft dynamics and control;Technical mechanics;2021-06-29

3. Deep learning for spacecraft guidance, navigation, and control;Kosmìčna nauka ì tehnologìâ;2021