A Bayesian reinforcement learning approach in markov games for computing near-optimal policies-Reference-Cited by-同舟云学术

A Bayesian reinforcement learning approach in markov games for computing near-optimal policies

Published:2023-06-10 Issue:5 Volume:91 Page:675-690
ISSN:1012-2443
Container-title:Annals of Mathematics and Artificial Intelligence
language:en
Short-container-title:Ann Math Artif Intell

Author:

Clempner Julio B.^ORCID

Publisher

Springer Science and Business Media LLC

Subject

Applied Mathematics,Artificial Intelligence

Link

https://link.springer.com/content/pdf/10.1007/s10472-023-09860-3.pdf

Reference38 articles.

1. Araya-López V.M. Thomas, Buffet O.: Near-optimal brl using optimistic local transitions. In: ICML’12: Proceedings of the 29th International Coference on Machine Learning, Omnipres, Edinburgh, Scotland, pp 97–104 (2012)

2. Asiain, E., Clempner, J.B., Poznyak, A.S.: Controller exploitation-exploration: A reinforcement learning architecture. Soft Computing 23(11), 3591–3604 (2019)

3. Asmuth J., Li L., Littman M., Nouri A., Wingate D.: A bayesian sampling approach to exploration in reinforcement learning. In: UAI ’09: Proceedings of the Twenty-Fifth Conference on Uncertainty in Artificial Intelligence, AUAI Press, Montreal, Quebec, Canada, pp 19–26 (2009)

4. Bellman R.: (1961) Adaptive Control Processes: A Guided Tour. Princeton University Press

5. Besson, R., Le Pennec, E.: Allassonnière S,: Learning from both experts and data. Entropy 21(12), 1208 (2019). https://doi.org/10.3390/e21121208

Cited by 1 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Joint Observer and Mechanism Design;Optimization and Games for Controllable Markov Chains;2023-12-14