1. Sutton RS, Barto AG (2018) Reinforcement learning: an introduction. MIT press, New York
2. Li Y (2017) Deep reinforcement learning: an overview. arXiv preprint arXiv:1701.07274
3. Watkins CJ, Dayan P (1992) Q-learning. Mach Learn 8(3–4):279–292
4. Nair A, Srinivasan P, Blackwell S, Alcicek C, Fearon R, De Maria A, Panneershelvam V, Suleyman M, Beattie C, Petersen S et al (2015) Massively parallel methods for deep reinforcement learning. arXiv preprint arXiv:1507.04296
5. Liu X, Diao J, Li N (2022) A FPGA-based accelerator implementation for path planning using Q_learning algorithm. J Phys Conf Ser 2245(1):5475