Publisher
Springer International Publishing
Reference38 articles.
1. Amari, S.I.: Natural gradient works efficiently in learning. Neural Comput. 10(2), 251–276 (1998)
2. Amato, C., Dibangoye, J.S., Zilberstein, S.: Incremental policy generation for finite-horizon DEC-POMDPs. In: Proceedings of the Nineteenth International Conference on Automated Planning and Scheduling (2009)
3. Aström, K.J.: Optimal control of Markov decision processes with incomplete state estimation. J. Math. Anal. Appl. 10, 174–205 (1965)
4. Bellman, R.E.: The Theory of dynamic programming. Bull. Am. Math. Soc. 60(6), 503–515 (1954)
5. Bernstein, D.S., Givan, R., Immerman, N., Zilberstein, S.: The complexity of decentralized control of Markov decision processes. Math. Oper. Res. 27(4), 819–840 (2002)
Cited by
9 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献