Improving the speed of convergence of multi-agent Q-learning for cooperative task-planning by a robot-team-Reference-Cited by-同舟云学术

Improving the speed of convergence of multi-agent Q-learning for cooperative task-planning by a robot-team

Published:2017-06 Issue: Volume:92 Page:66-80
ISSN:0921-8890
Container-title:Robotics and Autonomous Systems
language:en
Short-container-title:Robotics and Autonomous Systems

Author:

Sadhu Arup Kumar,Konar Amit

Funder

Council of Scientific and Industrial Research (CSIR)

UGC

Publisher

Elsevier BV

Subject

Computer Science Applications,General Mathematics,Software,Control and Systems Engineering

Reference66 articles.

1. Reinforcement Learning and Dynamic Programming Using Function Approximators;Busoniu,2010

2. B. Banerjee, S. Sen, J. Peng, Fast concurrent reinforcement learners, in: International Joint Conference on Artificial Intelligence, vol. 17, no. 1, Seattle, Washington, USA, 2001, pp. 825–832

3. The Q-learning obstacle avoidance algorithm based on EKF-SLAM for NAO autonomous walking under unknown environments;Wen;Robot. Auton. Syst.,2015

4. Y. Shoham, R. Powers and T. Grenager, Multiagent Reinforcement Learning: A Critical Survey, Web manuscript, 2003.

5. Innovations in Multi-agent Systems and Applications-1,2010

Cited by 23 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Correlated Equilibrium based Online Real-time Distributed Dynamic Task Scheduler for Multi-agent Systems;2024 International Joint Conference on Neural Networks (IJCNN);2024-06-30

2. RevAP: A bankruptcy-based algorithm to solve the multi-agent credit assignment problem in task start threshold-based multi-agent systems;Robotics and Autonomous Systems;2024-04

3. Drone Swarm Coordination Using Reinforcement Learning for Efficient Wildfires Fighting;SN Computer Science;2024-03-13

4. Coupling Effect of Exploration Rate and Learning Rate for Optimized Scaled Reinforcement Learning;SN Computer Science;2023-08-25

5. Self-improving Q-learning based controller for a class of dynamical processes;Archives of Control Sciences;2023-07-26