1. Morales, M.: Grokking deep reinforcement learning. Manning publication, New York (2020)
2. Kurinov, I., Orzechowski, G., Hämäläinen, P., Mikkola, A.: Automated excavator based on reinforcement learning and multibody system dynamics. IEEE Access 8, 213998–214006 (2020)
3. Luo, F., Xu, T., Lai, H., Chen, X., Zhang, W., Yu, Y.: A survey on model-based reinforcement learning. arXiv preprint arxiv:2206.09328 (2022)
4. Xiao, C., Wu, Y., Ma, C., Schuurmans, D., Müller, M.: Learning to combat compounding-error in model-based reinforcement learning. arXiv:1912.11206 (2019)
5. Puterman, M.: Markov decision processes: discrete stochastic dynamic programming. Wiley, New York (2013)