Subject
Applied Mathematics,Industrial and Manufacturing Engineering,Management Science and Operations Research,Software
Reference7 articles.
1. On the empirical state-action frequencies in Markov decision processes under general policies;Mannor;Mathematics of Operations Research,2005
2. L. Alfaro, Formal verification of probabilistic systems, Ph.D. Thesis, Stanford University, Stanford, CA, USA, 1997
3. Hoeffding’s inequality for uniformly ergodic Markov chains;Glynn;Statistics and Probability Letters,2002
4. Rate of convergence of empirical measures and costs in controlled Markov chains and transient optimality;Altman;Mathematics of Operations Research,1994
5. Denumerable Markov Chains;Kemeny,1976
Cited by
4 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献