Author:
Kamalapurkar Rushikesh,Walters Patrick,Rosenfeld Joel,Dixon Warren
Publisher
Springer International Publishing
Reference140 articles.
1. Barto A, Sutton R, Anderson C (1983) Neuron-like adaptive elements that can solve difficult learning control problems. IEEE Trans Syst Man Cybern 13(5):834–846
2. Sutton R (1988) Learning to predict by the methods of temporal differences. Mach Learn 3(1):9–44
3. Werbos P (1990) A menu of designs for reinforcement learning over time. Neural Netw Control 67–95
4. Watkins C, Dayan P (1992) Q-learning. Mach Learn 8(3):279–292
5. Bellman RE (2003) Dynamic programming. Dover Publications, Inc, New York