On Monte-Carlo tree search for deterministic games with alternate moves and complete information-Reference-Cited by-同舟云学术

On Monte-Carlo tree search for deterministic games with alternate moves and complete information

Published:2019 Issue: Volume:23 Page:176-216
ISSN:1262-3318
Container-title:ESAIM: Probability and Statistics
language:
Short-container-title:ESAIM: PS

Author:

Delattre Sylvain,Fournier Nicolas

Abstract

We consider a deterministic game with alternate moves and complete information, of which the issue is always the victory of one of the two opponents. We assume that this game is the realization of a random model enjoying some independence properties. We consider algorithms in the spirit of Monte-Carlo Tree Search, to estimate at best the minimax value of a given position: it consists in simulating, successively, n well-chosen matches, starting from this position. We build an algorithm, which is optimal, step by step, in some sense: once the n first matches are simulated, the algorithm decides from the statistics furnished by the n first matches (and the a priori we have on the game) how to simulate the (n + 1)th match in such a way that the increase of information concerning the minimax value of the position under study is maximal. This algorithm is remarkably quick. We prove that our step by step optimal algorithm is not globally optimal and that it always converges in a finite number of steps, even if the a priori we have on the game is completely irrelevant. We finally test our algorithm, against MCTS, on Pearl’s game [Pearl, Artif. Intell. 14 (1980) 113–138] and, with a very simple and universal a priori, on the game Connect Four and some variants. The numerical results are rather disappointing. We however exhibit some situations in which our algorithm seems efficient.

Publisher

EDP Sciences

Subject

Statistics and Probability

Link

https://www.esaim-ps.org/10.1051/ps/2018006/pdf

Reference22 articles.

1. Expected-outcome: a general model of static evaluation

2. A Bayesian approach to relevance in game playing

3. A Survey of Monte Carlo Tree Search Methods

4. Regret Analysis of Stochastic and Nonstochastic Multi-armed Bandit Problems

Cited by 1 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Analysis on the Procurement Cost of Construction Supply Chain based on Evolutionary Game Theory;Arabian Journal for Science and Engineering;2021-01-06