Author:
Chen Keqin,Lei Jianchang,Li Bin
Abstract
Abstract
For the pursuit-evasion (PE) game, this paper proposes an evasion strategy with coordinated control of angle of attack, bank angle and body morphing control using deep reinforcement learning. Considering the evasion and ballistic regression of the aircraft, the specified miss distance (SMD) and residual energy are used as the optimization objectives, to acquire the optimal control strategy against the encounter with pursuer in the terminal guidance phase. For the problem of sparse rewards, reward reshaping cannot be performed for this problem, we modify DQN algorithm with the mechanism of Monte-Carlo reinforcement learning to improve the sampling efficiency and realize the end-to-end learning. Finally, the linear analytical solution of the problem based on SMD is analyzed theoretically. With it, the strategy obtained by reinforcement learning is compared and explained.
Subject
General Physics and Astronomy
Reference27 articles.
1. Analysis of optimal evasive maneuvers based on a linearized two-dimensional kinematic model;Shinar;Journal of Aircraft,2012
2. Optimal evasive tactics against a proportional navigation missile with time delay;Slater;Journal of Spacecraft and Rockets,1973
3. Maneuver penetration strategy of near space vehicle based on differential game;Wang;Acta Aeronautica et Astronautica Sinica,2020
4. Optimal planar evasive aircraft maneuvers against proportional navigation missiles;Ong;Journal of Guidance, Control, and Dynamics,1996
5. Optimal interception of evasive missile warheads: Numerical solution of the differential game;Pontani;Journal of guidance, control, and dynamics,2008