1. Discrete-time controlled Markov processes with average cost criterion: A survey;Arapostathis;SIAM Journal on Control and Optimization,1993
2. Arts, J., Van Vuuren, M., & Kiesmuller, G. (2009). Efficient optimization of the dual-index policy using Markov chains. Tech. rep., Technische Universiteit Eindhoven.
3. Set-valued analysis;Aubin,1999
4. Neuro-dynamic programming;Bertsekas,1996