1. Abdolmaleki, A., Springenberg, J.T., Tassa, Y., Munos, R., Heess, N., Riedmiller, M.: Maximum a posteriori policy optimisation. In: International Conference on Learning Representations (2018). https://openreview.net/forum?id=S1ANxQW0b
2. Attias, H.: Planning by probabilistic inference. In: Bishop, C.M., Frey, B.J. (eds.) Proceedings of the Ninth International Workshop on Artificial Intelligence and Statistics. Proceedings of Machine Learning Research, R4, pp. 9–16 (2003). https://proceedings.mlr.press/r4/attias03a.html
3. Axelrod, R., Hamilton, W.D.: The evolution of cooperation. Science 211(4489), 1390–1396 (1981). https://doi.org/10.1126/science.7466396
4. Blei, D.M., Kucukelbir, A., McAuliffe, J.D.: Variational inference: a review for statisticians. J. Am. Stat. Assoc. 112(518), 859–877 (2017). https://doi.org/10.1080/01621459.2017.1285773
5. Botvinick, M., Toussaint, M.: Planning as inference. Trends Cogn. Sci. 16(10), 485–488 (2012). https://doi.org/10.1016/j.tics.2012.08.006