1. A comparison study of cooperative q-learning algorithms for independent learners;Abed-Alguni;Int. J. Artif. Intell.,2016
2. Adaptive Control;Åström,2013
3. Fractals Everywhere;Barnsley,2014
4. Successor features for transfer in reinforcement learning;Barreto,2017
5. Y. Bengio, N. Léonard, A. Courville, Estimating or propagating gradients through stochastic neurons for conditional computation, arXiv preprint arXiv:1308.3432 (2013).