1. M. Akian, S. Gaubert, Policy iteration for perfect information stochastic mean payoff games with bounded first return times is strongly polynomial, 2013. ArXiv:1310.4953.
2. Infinite Dimensional Analysis: A Hitchhiker’s Guide;Aliprantis,2006
3. Constrained Markov Decision Processes;Altman,1999
4. Stochastic Optimal Control: The Discrete Time Case;Bertsekas,1978
5. Average control of Markov decision processes with Feller transition probabilities and general action spaces;Costa;J. Math. Anal. Appl.,2012