Single-Agent Reinforcement Learning for Scalable Earth-Observing Satellite Constellation Operations-Reference-Cited by-同舟云学术

Single-Agent Reinforcement Learning for Scalable Earth-Observing Satellite Constellation Operations

Published:2024-01 Issue:1 Volume:61 Page:114-132
ISSN:0022-4650
Container-title:Journal of Spacecraft and Rockets
language:en
Short-container-title:Journal of Spacecraft and Rockets

Author:

Herrmann Adam¹^ORCID,Stephenson Mark A.¹,Schaub Hanspeter¹^ORCID

Affiliation:

1. University of Colorado, Boulder, Colorado 80303

Abstract

This work explores single-agent reinforcement learning for the multi-satellite agile Earth-observing scheduling problem. The objective of the problem is to maximize the weighted sum of imaging targets collected and downlinked while avoiding resource constraint violations on board the spacecraft. To avoid the computational complexity associated with multi-agent deep reinforcement learning while creating a robust and scalable solution, a policy is trained in a single satellite environment. This policy is then deployed on board each satellite in a Walker-delta constellation. A global set of targets is distributed to each satellite based on target access. The satellites communicate with one another to determine whether an imaging target is imaged or downlinked. Free communication, line-of-sight communication, and no communication are explored to determine how the communication assumptions and constellation design impact performance. Free communication is shown to produce the best performance, and no communication is shown to produce the worst performance. Line-of-sight communication performance is shown to depend heavily on the design of the constellation and how frequently the satellites can communicate with one another. To explore how higher-level coordination can impact performance, a centralized mixed-integer programming optimization approach to global target distribution is explored and compared to a decentralized approach. A genetic algorithm is also implemented for comparison purposes, and the proposed method is shown to achieve higher reward on average at a fraction of the computational cost.

Funder

National Aeronautics and Space Administration

Air Force Research Laboratory

Publisher

American Institute of Aeronautics and Astronautics (AIAA)

Subject

Space and Planetary Science,Aerospace Engineering

Link

https://arc.aiaa.org/doi/am-pdf/10.2514/1.A35736

Reference33 articles.

1. Agile Earth Observation Satellite Scheduling Over 20 Years: Formulations, Methods, and Future Directions

2. Optimization-based scheduling for the single-satellite, multi-ground station communication problem

3. Optimization-Based Scheduling Method for Agile Earth-Observing Satellite Constellation

4. Task Scheduling of Agile Satellites with Transition Time and Stereoscopic Imaging Constraints

5. A mixed integer linear programming model for multi-satellite scheduling

Cited by 2 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. A mission planning method for deep space detectors using deep reinforcement learning;Aerospace Science and Technology;2024-10

2. Reinforcement learning-based satellite formation attitude control under multi-constraint;Advances in Space Research;2024-08