1. Åström KJ, Wittenmark B (1995) Adaptive control, 2nd edn. Addison-Wesley, Reading
2. Ammar HB, Tuyls K, Taylor ME, Driessens K, Weiss G (2012) Reinforcement learning transfer via sparse coding. In: Proceedings of the 11th international conference on autonomous agents and multiagent systems, vol 1. International Foundation for Autonomous Agents and Multiagent Systems, pp 383–390
3. Ammar HB, Eaton E, Ruvolo P, Taylor ME (2015) Unsupervised cross-domain transfer in policy gradient reinforcement learning via manifold alignment. In: Proceedings of the AAAI
4. Axelrod A, Chowdhary G (2015) The explore-exploit dilemma in nonstationary decision making under uncertainty. In: The explore-exploit dilemma in nonstationary decision making under uncertainty, ser 2198–4182, 1st edn. Springer International Publishing. https://www.springerprofessional.de/en/t he-explore-exploit-dilemma-in-nonstationary-decision- making-und/7454158
5. Banerjee B, Stone P (2007) General game learning using knowledge transfer. In: IJCAI, pp 672–677