1. D. Abel, Y. Jinna, Y. Guo, G. Konidaris and M.L. Littman, Policy and value transfer in lifelong reinforcement learning, in: Proceedings of the International Conference on Machine Learning (ICML 2018), Stockholm, Sweden, 2018, pp. 1–10.
2. M. Andrychowicz, M. Denil, S.G. Colmenarejo and M.W. Hoffman, Learning to learn by gradient descent by gradient descent, in: Proceedings of the Conference on Neural Information Processing Systems (NeurIPS 2016), 2016, pp. 1–17.
3. Finite-time analysis of the multiarmed bandit problem;Auer;Machine Learning,2002
4. The capacity of feedforward neural networks;Baldi;Neural Networks,2019
5. J. Bieger, K.R. Thorisson, B.R. Steunebrink, T. Thorarensen and J.S. Sigurdardottir, Evaluation of general-purpose artificial intelligence: Why, what & how, in: Evaluating General-Purpose A.I. Workshop in the European Conference on Artificial Intelligence (ECAI 2016), The Hague, The Netherlands, 2016.