1. Astrom, K.J., Wittenmark, B.: Adaptive Control. Addison-Wesley, New York (1979)
2. Baird III, L.C.: Reinforcement learning in continuous time: advantage updating. In: Proceedings of the IEEE International Conference on Neural Networks, pp. 2448–2453 (1994)
3. Balaji, P.G., German, X., Srinivasan, D.: Urban traffic signal control using reinforcement learning agents. IET Intell. Transp. Sy. 4, 177–188 (2010)
4. Barto, A., Sutton, R.: Reinforcement Learning: An Introduction. MIT Press, Cambridge (1998)
5. Barto, A., Mahadevan, S.: Recent advances in hierarchical reinforcement learning. Discrete Event Dyn. Syst. 13, 343–379 (2003)