1. Fast concurrent reinforcement learners;Banerjee B.;International Joint conference on Artificial Intelligence,2001
2. The Q‐learning obstacle avoidance algorithm based on EKF‐SLAM for NAO autonomous walking under unknown environments;Wen S.;Robotics and Autonomous Systems,2015
3. Shoham Y. Powers R. andGrenager T.(2003).Multiagent reinforcement learning: a critical survey Web manuscript 2003.https://www.cc.gatech.edu/classes/AY2009/cs7641_spring/handouts/MALearning_ACriticalSurvey_2003_0516.pdf(accessed 26 May 2020).
4. Innovations in Multi-Agent Systems and Applications - 1