1. Reinforcement Learning and Dynamic Programming Using Function Approximators;Busoniu,2010
2. B. Banerjee, S. Sen, J. Peng, Fast concurrent reinforcement learners, in: International Joint Conference on Artificial Intelligence, vol. 17, no. 1, Seattle, Washington, USA, 2001, pp. 825–832
3. The Q-learning obstacle avoidance algorithm based on EKF-SLAM for NAO autonomous walking under unknown environments;Wen;Robot. Auton. Syst.,2015
4. Y. Shoham, R. Powers and T. Grenager, Multiagent Reinforcement Learning: A Critical Survey, Web manuscript, 2003.
5. Innovations in Multi-agent Systems and Applications-1,2010