Affiliation:
1. Research Center of Satellite Technology, School of Astronautics,
Harbin Institute of Technology, Harbin, China.
2. Space Missions Engineering Lab, Department of Aerospace Science and Technology,
Politecnico di Milano, Milano, Italy.
Abstract
Aiming at the interception problem of noncooperative evader spacecraft adopting random maneuver strategy in one-to-one orbital pursuit–evasion problem, an interception strategy with decision-making training mechanism for the pursuer based on deep reinforcement learning is proposed. Its core purpose is to improve the success rate of interception in the environment with high uncertainty. First of all, a multi-impulse orbit transfer model of pursuer and evader is established, and a modular deep reinforcement learning training method is built. Second, an effective reward mechanism is proposed to train the pursuer to choose the impulse direction and impulse interval of the orbit transfer and to learn the successful interception strategy with the optimal fuel and time. Finally, with the evader taking a random maneuver decision in each episode of training, the trained decision-making strategy is applied to the pursuer, the corresponding interception success rate of which is further analyzed. The results show that the pursuer trained can obtain universal and variable interception strategy. In each round of pursuit–evasion, with random maneuver strategy of the evader, the pursuer can adopt similar optimal decisions to deal with high-dimensional environments and thoroughly random state space, maintaining high interception success rate.
Publisher
American Association for the Advancement of Science (AAAS)
Reference23 articles.
1. Li YL. The attack orbit optimization of space attack and defense. Harbin, China: Harbin Engineering University; 2018.
2. Solution space exploration of low-thrust minimum-time trajectory optimization by combining two homotopies;Jingrui Z;Automatica,2023
3. Optimization of attacking orbit for interception satellite with low continuous thrust[J];Zhao L;Opt Precis Eng,2016
4. Shi M. Research on orbital optimization and control of interception satellite. Harbin, China: Harbin Institute of Technology; 2015.
5. A fuzzy controller for terminal approach of autonomous rendezvous and docking with non-cooperative target;Chen T;J Astronaut,2006
Cited by
6 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献