Multiphase Autonomous Docking via Model-Based and Hierarchical Reinforcement Learning-Reference-Cited by-同舟云学术

Multiphase Autonomous Docking via Model-Based and Hierarchical Reinforcement Learning

Published:2024-07 Issue:4 Volume:61 Page:993-1005
ISSN:0022-4650
Container-title:Journal of Spacecraft and Rockets
language:en
Short-container-title:Journal of Spacecraft and Rockets

Author:

Aborizk Anthony¹^ORCID,Fitz-Coy Norman¹^ORCID

Affiliation:

1. University of Florida, Gainesville, Florida 32611

Abstract

With the rise of traffic around Earth’s orbit, spacecraft mission designs have placed an unprecedented demand on the capabilities of autonomous systems. In the early 2000s, the state-of-the-art autonomous spacecraft controllers were designed for static and uncluttered environments. A little over a decade later, the challenges facing spacecraft autonomy now include cluttered, dynamic environments with time-varying constraints, logical modes, fault tolerances, uncertain dynamics, and complex maneuvers. With this rise in complexity, many areas of research have been investigating more experimental control strategies, such as reinforcement learning (RL), as a potential solution to this problem. The research presented herein aims to expand on efforts to quantify the use of RL in autonomous rendezvous, proximity operations, and docking (ARPOD) environments, with consideration to the inherent drawbacks of the more common algorithms present in the field. We present hierarchical model-based RL as a solution to an autonomous docking problem. This algorithm can learn satellite parameters, extrapolate trajectory information, and learn uncertain dynamics via data collection. By using gradient-free model predictive control logic, the algorithm can handle nondifferentiable objectives and complex constraints. Lastly, the hierarchical structure demonstrates an ability to generate feasible trajectories in the presence of integrated third-party subcontrollers commonly found in spacecraft. This study highlights the ability of the hierarchical algorithm to combine and manipulate third-party subpolicies to achieve trajectories not previously trained on.

Funder

National Science Foundation Graduate Research Fellowship Program

Publisher

American Institute of Aeronautics and Astronautics (AIAA)

Link

https://arc.aiaa.org/doi/pdf/10.2514/1.A35683

Reference19 articles.

1. National Research Council, “NASA Space Technology Roadmaps and Priorities: Restoring NASA’s Technological Edge and Paving the Way for a New Era in Space,” National Academies Press, Washington, D.C., 2012. 10.17226/13354

2. “ESA’s Annual Space Environment Report,” Tech. Rept. GEN-DB-LOG-00288-OPS-SD, European Space Agency, Darmstadt, Germany, 2022.

3. Spacecraft Trajectory Planning with Avoidance Constraints Using Mixed-Integer Linear Programming

4. Guidance Navigation and Control for Autonomous Multiple Spacecraft Assembly: Analysis and Experimentation

5. JewisonC. “Guidance and Control for Multi-Stage Rendezvous and Docking Operations in the Presence of Uncertainty,” Ph.D. Dissertation, Massachusetts Inst. of Technology, Boston, 2017.