DeltaDou: Expert-level Doudizhu AI through Self-play-Reference-Cited by-同舟云学术

DeltaDou: Expert-level Doudizhu AI through Self-play

Published:2019-08 Issue: Volume: Page:
ISSN:
Container-title:Proceedings of the Twenty-Eighth International Joint Conference on Artificial Intelligence
language:
Short-container-title:

Author:

Jiang Qiqi¹,Li Kuangzheng¹,Du Boyao¹,Chen Hao¹,Fang Hai¹

Affiliation:

1. SweetCode Inc, Beijing

Abstract

Artificial Intelligence has seen several breakthroughs in two-player perfect information game. Nevertheless, Doudizhu, a three-player imperfect information game, is still quite challenging. In this paper, we present a Doudizhu AI by applying deep reinforcement learning from games of self-play. The algorithm combines an asymmetric MCTS on nodes of information set of each player, a policy-value network that approximates the policy and value on each decision node, and inference on unobserved hands of other players by given policy. Our results show that self-play can significantly improve the performance of our agent in this multi-agent imperfect information game. Even starting with a weak AI, our agent can achieve human expert level after days of self-play and training.

Publisher

International Joint Conferences on Artificial Intelligence Organization

Cited by 26 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Perfect Information Monte Carlo with Postponing Reasoning;2024 IEEE Conference on Games (CoG);2024-08-05

2. Learning in games: a systematic review;Science China Information Sciences;2024-06-28

3. Alternate inference-decision reinforcement learning with generative adversarial inferring for bridge bidding;Neural Computing and Applications;2024-05-22

4. Tjong: A transformer‐based Mahjong AI via hierarchical decision‐making and fan backward;CAAI Transactions on Intelligence Technology;2024-03-21

5. A hierarchical branch and bound algorithm for Mahjong deficiency;Soft Computing;2023-12-07