Intelligent maneuver strategy for hypersonic vehicles in three-player pursuit-evasion games via deep reinforcement learning-Reference-Cited by-同舟云学术

Intelligent maneuver strategy for hypersonic vehicles in three-player pursuit-evasion games via deep reinforcement learning

Published:2024-02-14 Issue: Volume:18 Page:
ISSN:1662-453X
Container-title:Frontiers in Neuroscience
language:
Short-container-title:Front. Neurosci.

Author:

Yan Tian,Jiang Zijian,Li Tong,Gao Mengjing,Liu Can

Abstract

Aiming at the rapid development of anti-hypersonic collaborative interception technology, this paper designs an intelligent maneuver strategy of hypersonic vehicles (HV) based on deep reinforcement learning (DRL) to evade the collaborative interception by two interceptors. Under the meticulously designed collaborative interception strategy, the uncertainty and difficulty of evasion are significantly increased and the opportunity for maneuvers is further compressed. This paper, accordingly, selects the twin delayed deep deterministic gradient (TD3) strategy acting on the continuous action space and makes targeted improvements combining deep neural networks to grasp the maneuver strategy and achieve successful evasion. Focusing on the time-coordinated interception strategy of two interceptors, the three-player pursuit and evasion (PE) problem is modeled as the Markov decision process, and the double training strategy is proposed to juggle both interceptors. In reward functions of the training process, the energy saving factor is set to achieve the trade-off between miss distance and energy consumption. In addition, the regression neural network is introduced into the deep neural network of TD3 to enhance intelligent maneuver strategies’ generalization. Finally, numerical simulations are conducted to verify that the improved TD3 algorithm can effectively evade the collaborative interception of two interceptors under tough situations, and the improvements of the algorithm in terms of convergence speed, generalization, and energy-saving effect are verified.

Publisher

Frontiers Media SA

Reference36 articles.

1. An image caption model based on attention mechanism and deep reinforcement learning;Bai;Front. Neurosci.,2023

2. Autonomous trajectory planning method for hypersonic vehicles in glide phase based on DDPG algorithm;Bao;Proc. Inst. Mech. Eng. Part G J. Aerospace Eng.,2023

3. A deep reinforcement learning-based approach to onboard trajectory generation for hypersonic vehicles;Bao;Aeronaut. J.,2023

4. A two-pursuer one-evader game with equal speed and finite capture radius;Casini;J. Intell. Robot. Syst.,2022

5. Trust region policy optimization guidance algorithm for intercepting maneuvering target;Chen;Acta Aeronautica et Astronautica Sin.,2023

Cited by 1 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Deep Reinforcement Learning-Based Differential Game Guidance Law against Maneuvering Evaders;Aerospace;2024-07-06