1. Abdolmaleki, A., Lioutikov, R., Peters, J., Lau, N., Reis, L.P., Neumann, G.: Model-based relative entropy stochastic search. In: Advances in Neural Information Processing Systems (NIPS), pp. 3537–3545 (2015)
2. Abdolmaleki, A., Springenberg, J.T., Tassa, Y., Munos, R., Heess, N., Riedmiller, M.: Maximum a posteriori policy optimisation. arXiv preprint arXiv:1806.06920 (2018)
3. Amari, S.I.: Natural gradient works efficiently in learning. Neural Comput. 10(2), 251–276 (1998)
4. Bagnell, J.A., Schneider, J.: Covariant policy search. In: International Joint Conference on Artificial Intelligence (IJCAI), pp. 1019–1024. Morgan Kaufmann Publishers Inc. (2003)
5. Boyd, S., Vandenberghe, L.: Convex Optimization. Cambridge University Press (2004)