1. Furfaro R, Bloise I, Orlandelli M, Di Lizia P, Topputo F, Linares R (2018) Deep learning for autonomous lunar landing. Adv Astronaut Sci 167:3285–3306
2. Brockman G, Cheung V, Pettersson L, Schneider J, Schulman J, Tang J, Zaremba W (2016) Openai gym. arXiv preprint arXiv:1606.01540
3. Levine S, Kumar A, Tucker G, Fu J (2020) Offline reinforcement learning: tutorial, review, and perspectives on open problems. arXiv preprint arXiv:2005.01643
4. Jang B, Kim M, Harerimana G, Kim JW (2019) Q-learning algorithms: a comprehensive classification and applications. IEEE access 7:133653–133667
5. Watkins CJ, Dayan P (1992) Q-learning. Machine learning 8:279–292