1. Sutton, R., Barto, A.: “Reinforcement Learning: An introduction,”. Cambring, MA: MIT Press (1998).
2. Barto A.: “Adaptive critics and the basal ganglia,”. In: Models of Information Processing in the Basal Ganglia, pp.215-232. Cambridge, MA: MIT Press (1995).
3. Suri, R., Schultz, W.: “A neural network model with dopamine-like reinforcement signal that learns a spatial delayed response task,”. In: Neuroscience
91(3):871-890 (1999).
4. Suri, R., Schultz, W.: “Temporal difference model reproduces anticipatory neural activity,”. In: Neural Computation
13:841-862 (2001).
5. Chrisman, L.: “Reinforcement learning with perceptual aliasing: The perceptual distinctions approach,”. In: Proc. Int’l. Conf on AAAI, pp.183-188 (1992).