Upper Confidence Trees with Short Term Partial Information-Reference-Cited by-同舟云学术

Upper Confidence Trees with Short Term Partial Information

Published:2011 Issue: Volume: Page:153-162
ISSN:0302-9743
Container-title:Applications of Evolutionary Computation
language:
Short-container-title:

Author:

Teytaud Olivier,Flory Sébastien

Publisher

Springer Berlin Heidelberg

Link

http://link.springer.com/content/pdf/10.1007/978-3-642-20525-5_16

Reference14 articles.

1. Audibert, J.-Y., Bubeck, S.: Minimax policies for adversarial and stochastic bandits. In: Proceedings of the Annual Conference on Learning Theory (COLT) (2009)

2. Auer, P., Cesa-Bianchi, N., Freund, Y., Schapire, R.E.: Gambling in a rigged casino: the adversarial multi-armed bandit problem. In: Proceedings of the 36th Annual Symposium on Foundations of Computer Science, pp. 322–331. IEEE Computer Society Press, Los Alamitos (1995)

3. Bouzy, B., Métivier, M.: Multi-agent learning experiments on repeated matrix games. In: ICML, pp. 119–126 (2010)

4. Lecture Notes in Computer Science;R. Coulom,2007

5. Grigoriadis, M.D., Khachiyan, L.G.: A sublinear-time randomized approximation algorithm for matrix games. Operations Research Letters 18(2), 53–58 (1995)

Cited by 14 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Decoupled Monte Carlo Tree Search for Cooperative Multi-Agent Planning;Applied Sciences;2023-02-02

2. An Efficient Dynamic Sampling Policy for Monte Carlo Tree Search;2022 Winter Simulation Conference (WSC);2022-12-11

3. Analysis of Hannan consistent selection for Monte Carlo tree search in simultaneous move games;Machine Learning;2019-07-25

4. No Free Lunch Theorem: A Review;Approximation and Optimization;2019

5. HoningStone: Building Creative Combos With Honing Theory for a Digital Card Game;IEEE Transactions on Computational Intelligence and AI in Games;2017-06