Actor–critic-based decision-making method for the artificial intelligence commander in tactical wargames-Reference-Cited by-同舟云学术

Actor–critic-based decision-making method for the artificial intelligence commander in tactical wargames

Published:2020-09-14 Issue: Volume: Page:154851292095454
ISSN:1548-5129
Container-title:The Journal of Defense Modeling and Simulation: Applications, Methodology, Technology
language:en
Short-container-title:Journal of Defense Modeling & Simulation

Author:

Zhang Junfeng¹²^ORCID,Xue Qing¹

Affiliation:

1. Military Exercise and Training Center, Army Academy of Armored Forces, China

2. Information Technology Department, Troop No. 32370 of PLA, China

Abstract

In a tactical wargame, the decisions of the artificial intelligence (AI) commander are critical to the final combat result. Due to the existence of fog-of-war, AI commanders are faced with unknown and invisible information on the battlefield and lack of understanding of the situation, and it is difficult to make appropriate tactical strategies. The traditional knowledge rule-based decision-making method lacks flexibility and autonomy. How to make flexible and autonomous decision-making when facing complex battlefield situations is a difficult problem. This paper aims to solve the decision-making problem of the AI commander by using the deep reinforcement learning (DRL) method. We develop a tactical wargame as the research environment, which contains built-in script AI and supports the machine–machine combat mode. On this basis, an end-to-end actor–critic framework for commander decision making based on the convolutional neural network is designed to represent the battlefield situation and the reinforcement learning method is used to try different tactical strategies. Finally, we carry out a combat experiment between a DRL-based agent and a rule-based agent in a jungle terrain scenario. The result shows that the AI commander who adopts the actor–critic method successfully learns how to get a higher score in the tactical wargame, and the DRL-based agent has a higher winning ratio than the rule-based agent.

Publisher

SAGE Publications

Subject

Engineering (miscellaneous),Modeling and Simulation

Link

http://journals.sagepub.com/doi/pdf/10.1177/1548512920954542

Reference34 articles.

1. Mastering the game of Go with deep neural networks and tree search

Cited by 3 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Applying Deep Reinforcement Learning to Train AI Agents in a Wargaming Framework;SoutheastCon 2024;2024-03-15

2. Technological Management Innovation: A Combination of Technology Roadmap and Wargame;2023 IEEE International Conference on Networking, Sensing and Control (ICNSC);2023-10-25

3. Hierarchical Architecture for Multi-Agent Reinforcement Learning in Intelligent Game;2022 International Joint Conference on Neural Networks (IJCNN);2022-07-18