1. Alexander, S.A., Castaneda, M., Compher, K., Martinez, O.: Extending environments to measure self-reflection in reinforcement learning. J. Artif. Gen. Intell. 13(1), 1–24 (2022)
2. Alexander, S.A., Quarel, D., Du, L., Hutter, M.: Universal agent mixtures and the geometry of intelligence. In: AISTATS, PMLR (2023)
3. Bell, J., Linsefors, L., Oesterheld, C., Skalse, J.: Reinforcement learning in Newcomblike environments. In: NeurIPS (2021)
4. Hutter, M.: Universal Artificial Intelligence: sequential Decisions Based on Algorithmic Probability. Springer (2004)
5. Hutter, M.: Discrete MDL predicts in total variation. In: Advances in Neural Information Processing Systems, vol. 22 (2009)