1. Abdolmaleki, A., Springenberg, J.T., Tassa, Y., Munos, R., Heess, N., and Riedmiller, M. (2018). Maximum a Posteriori Policy Optimisation. arXiv preprint arXiv:1806.06920.
2. A general class of coefficients of divergence of one distribution from another;Ali;Journal of the Royal Statistical Society: Series B (Methodological),1966
3. Amodei, D., Olah, C., Steinhardt, J., Christiano, P., Schul-man, J., and Mané, D. (2016). Concrete problems in ai safety. arXiv preprint arXiv:1606.06565.
4. Coherent measures of risk;Artzner;Mathematical fnance,1999
5. Robust and Risk-sensitive Output Feedback Control for Finite State Machines and Hidden Markov Models;Baras;Journal of Mathematical Systems, Estimation, and Control,1997