Research on Air Combat Maneuver Decision-Making Method Based on Reinforcement Learning-Reference-Cited by-同舟云学术

Research on Air Combat Maneuver Decision-Making Method Based on Reinforcement Learning

Published:2018-10-27 Issue:11 Volume:7 Page:279
ISSN:2079-9292
Container-title:Electronics
language:en
Short-container-title:Electronics

Author:

Zhang Xianbing,Liu Guoqing,Yang Chaojie,Wu Jiang

Abstract

With the development of information technology, the degree of intelligence in air combat is increasing, and the demand for automated intelligent decision-making systems is becoming more intense. Based on the characteristics of over-the-horizon air combat, this paper constructs a super-horizon air combat training environment, which includes aircraft model modeling, air combat scene design, enemy aircraft strategy design, and reward and punishment signal design. In order to improve the efficiency of the reinforcement learning algorithm for the exploration of strategy space, this paper proposes a heuristic Q-Network method that integrates expert experience, and uses expert experience as a heuristic signal to guide the search process. At the same time, heuristic exploration and random exploration are combined. Aiming at the over-the-horizon air combat maneuver decision problem, the heuristic Q-Network method is adopted to train the neural network model in the over-the-horizon air combat training environment. Through continuous interaction with the environment, self-learning of the air combat maneuver strategy is realized. The efficiency of the heuristic Q-Network method and effectiveness of the air combat maneuver strategy are verified by simulation experiments.

Publisher

MDPI AG

Subject

Electrical and Electronic Engineering,Computer Networks and Communications,Hardware and Architecture,Signal Processing,Control and Systems Engineering

Link

http://www.mdpi.com/2079-9292/7/11/279/pdf

Reference18 articles.

1. On Applied Nonlinear and Bilevel Programming or Pursuit-Evasion Games

2. Game theory for automated maneuvering during air-to-air combat

3. Modeling Pilot's Sequential Maneuvering Decisions by a Multistage Influence Diagram

Cited by 50 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Deep Recurrent Reinforcement Learning for Intercept Guidance Law under Partial Observability;Applied Artificial Intelligence;2024-05-16

2. Optimal confrontation position selecting games model and its application to one-on-one air combat;Defence Technology;2024-01

3. UAV Interception and Confrontation Maneuver Decision-Making Based on Reinforcement Learning;2023 3rd International Conference on Electronic Information Engineering and Computer (EIECT);2023-11-17

4. Dual-UAVs Maneuvering Strategy Generation Algorithm Based on Cooperative Reward Mechanism and MATD3;2023 11th International Conference on Control, Mechatronics and Automation (ICCMA);2023-11-01

5. An Evolutionary Reinforcement Learning Approach for Autonomous Maneuver Decision in One-to-One Short-Range Air Combat;2023 IEEE/AIAA 42nd Digital Avionics Systems Conference (DASC);2023-10-01