Affiliation:
1. Air Traffic Control and Navigation College, Air Force Engineering University, Xi’an 710051, China
2. Aviation Maintenance NCO School, Air Force Engineering University, Xinyang 464000, China
Abstract
Rapid and precise air operation mission planning is a key technology in unmanned aerial vehicles (UAVs) autonomous combat in battles. In this paper, an end-to-end UAV intelligent mission planning method based on deep reinforcement learning (DRL) is proposed to solve the shortcomings of the traditional intelligent optimization algorithm, such as relying on simple, static, low-dimensional scenarios, and poor scalability. Specifically, the suppression of enemy air defense (SEAD) mission planning is described as a sequential decision-making problem and formalized as a Markov decision process (MDP). Then, the SEAD intelligent planning model based on the proximal policy optimization (PPO) algorithm is established and a general intelligent planning architecture is proposed. Furthermore, three policy training tricks, i.e., domain randomization, maximizing policy entropy, and underlying network parameter sharing, are introduced to improve the learning performance and generalizability of PPO. Experiments results show that the model in this work is efficient and stable, and can be adapted to the unknown continuous high-dimensional environment. It can be concluded that the UAV intelligent mission planning model based on DRL has powerful intelligent planning performance, and provides a new idea for researching UAV autonomy.
Funder
Natural Science Foundation of Shaanxi Province
Subject
Multidisciplinary,General Computer Science
Reference33 articles.
1. Overview of air vehicle mission planning techniques;L. Shen;Acta Aeronautica et Astronautica Sinica,2014
2. Joint mission planning system;Joint mission planning system,2020
3. Research on Route Planning based on improved Ant Colony Algorithm
4. Tactic maneuver planning of loft delivery of laser-guided bomb with no offset;Y. Zhang;Systems Engineering and Electronics,2016
5. MOEA/D: A Multiobjective Evolutionary Algorithm Based on Decomposition
Cited by
9 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献