1. Christopher M (1992) Logistic and supply chain management. Pitman Publishing, London
2. Watkins CJCH, Dayan P (1992) Q-learning. Mach Learn 8(3–4):279–292
3. Lin L-J (1993) Reinforcement learning for robots using neural networks. Technical report, DTIC Document
4. Baird L (1995) Residual algorithms: reinforcement learning with function approximation. In: Machine learning: proceedings of the twelfth international conference, pp 30–37
5. Kaelbling LP, Littman ML, Moore AW (1996) Reinforcement learning: a survey. J Artif Intell Res 4:237–285