1. Lenient frequency adjusted Q-learning;Bloembergen,2010
2. Controller design for quadrotor UAVs using reinforcement learning;Bou-Ammar,2010
3. Multiagent learning using a variable learning rate;Bowling;Artif. Intell.,2002
4. R-max – a general polynomial time algorithm for near-optimal reinforcement learning;Brafman;J. Mach. Learn. Res.,2003
5. A comprehensive survey of multiagent reinforcement learning;Busoniu;IEEE Trans. Syst. Man Cybern., Part C, Appl. Rev.,2008