1. Sutton RS, Barto AG (2018) Reinforcement learning: an introduction. MIT Press, Cambridge
2. Watkins CJ, Dayan P (1992) Q-learning. Mach Learn 8:279–292
3. Mariano CE, Morales EF (2001) DQL: a new updating strategy for reinforcement learning based on Q-learning. In: De Raedt L, Flach P (eds) Machine learning: ECML 2001. Springer Berlin Heidelberg, Berlin/Heidelberg, pp 324–335
4. Géron A (2022) Hands-on machine learning with Scikit-Learn, Keras, and TensorFlow. O’Reilly Media, Inc., Sebastopol
5. Lillicrap TP, Hunt JJ, Pritzel A, Heess N, Erez T, Tassa Y, Silver D, Wierstra D (2016) Continuous control with deep reinforcement learning. In: Proceedings of the International Conference on Learning Representations (ICLR), May 2016, pp 1–14