1. E. Todorov , T. Erez , and Y. Tassa , " MuJoCo: A physics engine for model-based control," in 2012 IEEE/RSJ International Conference on Intelligent Robots and Systems , pp. 5026 -- 5033 , 2012 . E. Todorov, T. Erez, and Y. Tassa, "MuJoCo: A physics engine for model-based control," in 2012 IEEE/RSJ International Conference on Intelligent Robots and Systems, pp. 5026--5033, 2012.
2. G. Brockman , V. Cheung , L. Pettersson , J. Schneider , J. Schulman , J. Tang , and W. Zaremba , " Openai gym," arXiv preprint arXiv:1606.01540 , 2016 . G. Brockman, V. Cheung, L. Pettersson, J. Schneider, J. Schulman, J. Tang, and W. Zaremba, "Openai gym," arXiv preprint arXiv:1606.01540, 2016.
3. H.-J. Kim and Y.-H. Kim "A Surrogate model using deep neural networks for optimal oil skimmer assignment " in Proceedings of the Genetic and Evolutionary Computation Conference Companion pp. 39--40 2020. H.-J. Kim and Y.-H. Kim "A Surrogate model using deep neural networks for optimal oil skimmer assignment " in Proceedings of the Genetic and Evolutionary Computation Conference Companion pp. 39--40 2020.
4. A. Plaat , W. Kosters , and M. Preuss , " Deep model-based reinforcement learning for high-dimensional problems, a Survey," arXiv preprint arXiv:2008.05598 , 2020 . A. Plaat, W. Kosters, and M. Preuss, "Deep model-based reinforcement learning for high-dimensional problems, a Survey," arXiv preprint arXiv:2008.05598, 2020.
5. S. Kumar "Balancing a Cartpole system with reinforcement learning-A tutorial " arXiv preprint arXiv:2006.04938 2020. S. Kumar "Balancing a Cartpole system with reinforcement learning-A tutorial " arXiv preprint arXiv:2006.04938 2020.