1. A survey of applications of Markov decision processes;White;The Journal of the Operational Research Society,1993
2. Neuro-dynamic programming;Bertsekas,1996
3. Reinforcement learning: a survey;Kaelbling;Journal of Artificial Intelligence Research,1996
4. Reinforcement learning: an introduction;Sutton,1998
5. Simulation-based optimization: parametric optimization techniques and reinforcement learning;Gosavi,2003