1. Sutton, R., Barto, A.: Reinforcement learning. MIT Press, Cambridge (1998)
2. Togelius, J., Schaul, T., Wierstra, D., Igel, C., Gomez, F., Schmidhuber, J.: Ontogenetic and phylogenetic reinforcement learning. ZeitschriftK unstlicheIntelligenz 3, 30–33 (2009)
3. Watkins, C.J.: Learning with Delayed Rewards. PhD thesis, Psychology Department, University of Cambridge, UK (1989)
4. Lecture Notes in Computer Science;J. Dowling,2005
5. Williams, R.J.: Simple statistical gradient-following algorithms for connectionist reinforcement learning. Machine Learning 8(3), 229–256 (1992)