Posterior sampling for Monte Carlo planning under uncertainty-Reference-Cited by-同舟云学术

Posterior sampling for Monte Carlo planning under uncertainty

Published:2018-08-15 Issue:12 Volume:48 Page:4998-5018
ISSN:0924-669X
Container-title:Applied Intelligence
language:en
Short-container-title:Appl Intell

Author:

Bai Aijun^ORCID,Wu Feng,Chen Xiaoping

Funder

National Research Founda- tion for the Doctoral Program of China

National Hi-Tech Project of China

National Natural Science Foundation of China (CN)

National Natural Science Foundation of China

Publisher

Springer Science and Business Media LLC

Subject

Artificial Intelligence

Link

http://link.springer.com/article/10.1007/s10489-018-1248-5/fulltext.html

Reference83 articles.

1. Agrawal S, Goyal N (2012) Analysis of thompson sampling for the multi-armed bandit problem. In: Conference on learning theory, pp 39.1–39.26

2. Agrawal S, Goyal N (2013) Further optimal regret bounds for Thompson sampling. In: Artificial intelligence and statistics, pp 99–107

3. Anand A, Mausam GA, Singla P (2015) ASAP-UCT: Abstraction of state-action pairs in UCT. In: Yang Q, Wooldridge M (eds) IJCAI. AAAI Press, pp 1509–1515

4. Anand A, Mausam RN, Singla P (2016) OGA-UCT: On-the-go abstractions in UCT. In: Coles AJ, Coles A, Edelkamp S, Magazzeni D, Sanner S (eds) ICAPS. AAAI Press, pp 29– 37

5. Asmuth J, Littman ML (2011) Learning is planning: near Bayes-optimal reinforcement learning via Monte-Carlo tree search. In: Uncertainty in artificial intelligence, pp 19–26

Cited by 2 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Methodology for the projection of population pyramids based on Monte Carlo simulation and genetic algorithms;Applied Intelligence;2023-02-16

2. Navigating Uncertainty: A Consensus-Based Algorithm for Solving the Stochastic Canadian Traveler Problem;2023