1. K. Hinderer, On approximate solutions of finite-stage dynamic programs,Dynamic Programming and its Applications, ed. M.L. Puterman (Academic Press, New York, 1978).
2. M. Kolonko, Bounds for the regret loss in dynamic programming, under adaptive control, Zeit. O.R. 27 (1983) 17–37.
3. M. Kurano, Average-optimal adaptive policies in semi-Markov decision processes including an unknown parameter, J. Oper. Res. Soc. Japan 28 (1985) 252–266.
4. H.J. Langen, Convergence of dynamic programming models, Math. Oper. Res. 6 (1981) 493–512.
5. C.D. Meyer Jr., The condition of a finite Markov chain and perturbation bounds for the limiting probabilities, SIAM J. Alg. Disc. Math. 1 (1980) 273–283.