1. Reinforcement learning and adaptive critic methods;Barto,1992
2. Neuronlike adaptive elements that can solve difficult learning control problems;Barto;IEEE Transactions on Systems, Man, and Cybernetics,1983
3. Truncating temporal differences: On the efficient implementation of TD(λ) for reinforcement learning;Cichosz;Journal of Artificial Intelligence Research,1995
4. The convergence of TD(λ) for general λ;Dayan;Machine Learning,1992
5. The Hedonistic Neuron: A Theory of Memory, Learning, and Intelligence;Klopf,1982