1. A. Aubret L. Matignon and S. Hassas "A survey on intrinsic motivation in reinforcement learning " arXiv preprint arXiv:1908.06976 2019. A. Aubret L. Matignon and S. Hassas "A survey on intrinsic motivation in reinforcement learning " arXiv preprint arXiv:1908.06976 2019.
2. B. Baker etal "Emergent tool use from multi-agent autocurricula " arXiv preprint arXiv:1909.07528 2019. B. Baker et al. "Emergent tool use from multi-agent autocurricula " arXiv preprint arXiv:1909.07528 2019.
3. G. Brockman etal "Openai gym " arXiv preprint arXiv:1606.01540 2016. G. Brockman et al. "Openai gym " arXiv preprint arXiv:1606.01540 2016.
4. Y. Burda H. Edwards D. Pathak A. Storkey T. Darrell and A. A. Efros "Large-scale study of curiosity-driven learning " arXiv preprint arXiv:1808.04355 2018. Y. Burda H. Edwards D. Pathak A. Storkey T. Darrell and A. A. Efros "Large-scale study of curiosity-driven learning " arXiv preprint arXiv:1808.04355 2018.
5. Y. Burda H. Edwards A. Storkey and O. Klimov "Exploration by random network distillation " arXiv preprint arXiv:1810.12894 2018. Y. Burda H. Edwards A. Storkey and O. Klimov "Exploration by random network distillation " arXiv preprint arXiv:1810.12894 2018.