1. Dynamic programming and stochastic control;Bertsekas,1976
2. Reinforcement learning applied to linear quadratic regulation;Bradtke,1993
3. A teaching method for reinforcement learning;Clouse,1992
4. Credit assignment in rule discovery systems based on genetic algorithms;Grefenstette;Mach. Learn.,1988
5. Cognitive systems based on adaptive algorithms;Holland,1987