1. Markov Decision Processes: Discrete Stochastic Dynamic Programming;Puterman,1994
2. Handbook of Markov Decision Processes: Methods and Applications;Feinberg,2002
3. Neuro-Dynamic Programming;Bertsekas,1996
4. J.C. Hennet, A graph formulation of some supervisory control problems, in: IEEE International Conference on Systems, Man and Cybernetics, Le Touquet (France), 17–20 October 1993, pp. 601–606.
5. On average reward semi-Markov decision processes with a general multichain structure;Liu;Mathematics of Operations Research,2004