Bandit Based Monte-Carlo Planning-Reference-Cited by-同舟云学术

Bandit Based Monte-Carlo Planning

Published:2006 Issue: Volume: Page:282-293
ISSN:0302-9743
Container-title:Lecture Notes in Computer Science
language:
Short-container-title:

Author:

Kocsis Levente,Szepesvári Csaba

Publisher

Springer Berlin Heidelberg

Link

http://link.springer.com/content/pdf/10.1007/11871842_29.pdf

Reference14 articles.

1. Auer, P., Cesa-Bianchi, N., Fischer, P.: Finite time analysis of the multiarmed bandit problem. Machine Learning 47(2-3), 235–256 (2002)

2. Auer, P., Cesa-Bianchi, N., Freund, Y., Schapire, R.E.: The nonstochastic multiarmed bandit problem. SIAM Journal on Computing 32, 48–77 (2002)

3. Barto, A.G., Bradtke, S.J., Singh, S.P.: Real-time learning and control using asynchronous dynamic programming. Technical report 91-57, Computer Science Department, University of Massachusetts (1991)

4. Billings, D., Davidson, A., Schaeffer, J., Szafron, D.: The challenge of poker. Artificial Intelligence 134, 201–240 (2002)

5. Bouzy, B., Helmstetter, B.: Monte Carlo Go developments. In: van den Herik, H.J., Iida, H., Heinz, E.A. (eds.) Advances in Computer Games 10, pp. 159–174 (2004)

Cited by 1321 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Search for a Suboptimal Solution to the Dynamic Traveling Salesman Problem by the Monte Carlo Method;Автоматика и телемеханика;2024-12-15

2. PPB-MCTS: A novel distributed-memory parallel partial-backpropagation Monte Carlo tree search algorithm;Journal of Parallel and Distributed Computing;2024-11

3. A crossword solving system based on Monte Carlo tree search;Artificial Intelligence;2024-10

4. DrugSynthMC: An Atom-Based Generation of Drug-like Molecules with Monte Carlo Search;Journal of Chemical Information and Modeling;2024-09-09

5. Stateful black-box fuzzing for encryption protocols and its application in IPsec;Computer Networks;2024-09