Missile guidance with assisted deep reinforcement learning for head-on interception of maneuvering target-Reference-Cited by-同舟云学术

Missile guidance with assisted deep reinforcement learning for head-on interception of maneuvering target

Published:2021-11-24 Issue: Volume: Page:
ISSN:2199-4536
Container-title:Complex & Intelligent Systems
language:en
Short-container-title:Complex Intell. Syst.

Author:

Li Weifan,Zhu Yuanheng^ORCID,Zhao Dongbin

Abstract

AbstractIn missile guidance, pursuit performance is seriously degraded due to the uncertainty and randomness in target maneuverability, detection delay, and environmental noise. In many methods, accurately estimating the acceleration of the target or the time-to-go is needed to intercept the maneuvering target, which is hard in an environment with uncertainty. In this paper, we propose an assisted deep reinforcement learning (ARL) algorithm to optimize the neural network-based missile guidance controller for head-on interception. Based on the relative velocity, distance, and angle, ARL can control the missile to intercept the maneuvering target and achieve large terminal intercept angle. To reduce the influence of environmental uncertainty, ARL predicts the target’s acceleration as an auxiliary supervised task. The supervised learning task improves the ability of the agent to extract information from observations. To exploit the agent’s good trajectories, ARL presents the Gaussian self-imitation learning to make the mean of action distribution approach the agent’s good actions. Compared with vanilla self-imitation learning, Gaussian self-imitation learning improves the exploration in continuous control. Simulation results validate that ARL outperforms traditional methods and proximal policy optimization algorithm with higher hit rate and larger terminal intercept angle in the simulation environment with noise, delay, and maneuverable target.

Funder

National Key Research and Development Program of China

strategic priority research program of chinese academy of sciences

youth innovation promotion association of the chinese academy of sciences

Publisher

Springer Science and Business Media LLC

Subject

General Earth and Planetary Sciences,General Environmental Science

Link

https://link.springer.com/content/pdf/10.1007/s40747-021-00577-6.pdf

Reference35 articles.

1. Anuse A, Vyas V (2016) A novel training algorithm for convolutional neural network. Complex Intell Syst 2(3):221–234

2. Caskey TR, Wasek JS, Franz AY (2018) Deter and protect: crime modeling with multi-agent learning. Complex Intell Syst 4(3):155–169

3. Chen Y, Zhao D, Li H (2019) Deep Kalman filter with optical flow for multiple object tracking. In: 2019 IEEE international conference on systems, man and cybernetics (SMC), pp 3036–3041

4. Coello CAC, Brambila SG, Gamboa JF, Tapia MGC, Gómez RH (2020) Evolutionary multiobjective optimization: open research areas and some challenges lying ahead. Complex Intell Syst 6(2):221–236

5. Ecoffet A, Huizinga J, Lehman J, Stanley KO, Clune J (2019) Go-explore: a new approach for hard-exploration problems. arXiv preprint. arXiv:1901.10995

Cited by 19 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Computational Intelligence Interception Guidance Law Using Online Off-Policy Integral Reinforcement Learning;Journal of Systems Engineering and Electronics;2024-08

2. Predictive air combat decision model with segmented reward allocation;Complex & Intelligent Systems;2024-07-22

3. Estimating Actual Size of Missile on Air using Object Detection Algorithm;2024 IEEE Space, Aerospace and Defence Conference (SPACE);2024-07-22

4. Deep Recurrent Reinforcement Learning for Intercept Guidance Law under Partial Observability;Applied Artificial Intelligence;2024-05-16

5. Exoatmospheric Evasion Guidance Law with Total Energy Limit via Constrained Reinforcement Learning;International Journal of Aeronautical and Space Sciences;2024-04-15