1. Achbany, Y., Fouss, F., Yen, L., Pirotte, A., Saerens, M.: Tuning continual exploration in reinforcement learning. Technical report (2005), http://www.isys.ucl.ac.be/staff/francois/Articles/Achbany2005a.pdf
2. Bazaraa, M.S., Sherali, H.D., Shetty, C.M.: Nonlinear programming: Theory and algorithms. John Wiley and Sons, Chichester (1993)
3. Bertsekas, D.P.: Neuro-dynamic programming. Athena Scientific, Belmont (1996)
4. Bertsekas, D.P.: Network optimization: continuous and discrete models. Athena Scientific, Belmont (1998)
5. Bertsekas, D.P.: Dynamic programming and optimal control. Athena sientific, Belmont (2000)