Learning Macromanagement in Starcraft by Deep Reinforcement Learning-Reference-Cited by-同舟云学术

Learning Macromanagement in Starcraft by Deep Reinforcement Learning

Published:2021-05-11 Issue:10 Volume:21 Page:3332
ISSN:1424-8220
Container-title:Sensors
language:en
Short-container-title:Sensors

Author:

Huang Wenzhen^ORCID,Yin Qiyue,Zhang Junge,Huang Kaiqi

Abstract

StarCraft is a real-time strategy game that provides a complex environment for AI research. Macromanagement, i.e., selecting appropriate units to build depending on the current state, is one of the most important problems in this game. To reduce the requirements for expert knowledge and enhance the coordination of the systematic bot, we select reinforcement learning (RL) to tackle the problem of macromanagement. We propose a novel deep RL method, Mean Asynchronous Advantage Actor-Critic (MA3C), which computes the approximate expected policy gradient instead of the gradient of sampled action to reduce the variance of the gradient, and encode the history queue with recurrent neural network to tackle the problem of imperfect information. The experimental results show that MA3C achieves a very high rate of winning, approximately 90%, against the weaker opponents and it improves the win rate about 30% against the stronger opponents. We also propose a novel method to visualize and interpret the policy learned by MA3C. Combined with the visualized results and the snapshots of games, we find that the learned macromanagement not only adapts to the game rules and the policy of the opponent bot, but also cooperates well with the other modules of MA3C-Bot.

Funder

National Natural Science Foundation of China

Beijing Nova Program of Science and Technology

Publisher

MDPI AG

Subject

Electrical and Electronic Engineering,Biochemistry,Instrumentation,Atomic and Molecular Physics, and Optics,Analytical Chemistry

Link

https://www.mdpi.com/1424-8220/21/10/3332/pdf

Reference40 articles.

1. Grandmaster level in StarCraft II using multi-agent reinforcement learning

2. A Survey of Real-Time Strategy Game AI Research and Competition in StarCraft

3. MSC: A Dataset for Macro-Management in StarCraft II;Wu;arXiv,2017

4. Starcraft bots and competitions;Churchill,2016

Cited by 2 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Learning cooperative strategies in StarCraft through role-based monotonic value function factorization;Electronic Research Archive;2024

2. Deep ensemble learning of tactics to control the main force in a real-time strategy game;Multimedia Tools and Applications;2023-06-24