1. Abdolmaleki, A., Springenberg, J., Tassa, Y., Munos, R., Heess, N., Riedmiller, M.: Maximum a posteriori policy optimisation. In: International Conference on Learning Representations (2018). https://openreview.net/forum?id=S1ANxQW0b
2. Attias, H.: Planning by probabilistic inference. In: International Workshop on Artificial Intelligence and Statistics, pp. 9–16. PMLR (2003)
3. Bishop, C.M., Nasrabadi, N.M.: Pattern Recognition and Machine Learning, vol. 4, no. 4, p. 738. Springer, New York (2006)
4. Da Costa, L., Parr, T., Sajid, N., Veselic, S., Neacsu, V., Friston, K.: Active inference on discrete state-spaces: a synthesis. J. Math. Psychol. 99, 102447 (2020)
5. Da Costa, L., Sajid, N., Parr, T., Friston, K., Smith, R.: Reward maximization through discrete active inference. Neural Comput. 35(5), 807–852 (2023). https://doi.org/10.1162/neco_a_01574