Cooperative Multi-agent Policy Gradient-Reference-Cited by-同舟云学术

Cooperative Multi-agent Policy Gradient

Published:2019 Issue: Volume: Page:459-476
ISSN:0302-9743
Container-title:Machine Learning and Knowledge Discovery in Databases
language:
Short-container-title:

Author:

Bono Guillaume^ORCID,Dibangoye Jilles Steeve,Matignon Laëtitia,Pereyron Florian,Simonin Olivier

Publisher

Springer International Publishing

Link

http://link.springer.com/content/pdf/10.1007/978-3-030-10925-7_28

Reference38 articles.

1. Amari, S.I.: Natural gradient works efficiently in learning. Neural Comput. 10(2), 251–276 (1998)

2. Amato, C., Dibangoye, J.S., Zilberstein, S.: Incremental policy generation for finite-horizon DEC-POMDPs. In: Proceedings of the Nineteenth International Conference on Automated Planning and Scheduling (2009)

3. Aström, K.J.: Optimal control of Markov decision processes with incomplete state estimation. J. Math. Anal. Appl. 10, 174–205 (1965)

4. Bellman, R.E.: The Theory of dynamic programming. Bull. Am. Math. Soc. 60(6), 503–515 (1954)

5. Bernstein, D.S., Givan, R., Immerman, N., Zilberstein, S.: The complexity of decentralized control of Markov decision processes. Math. Oper. Res. 27(4), 819–840 (2002)

Cited by 9 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Mission Planning for Multiple Autonomous Underwater Vehicles with Constrained In Situ Recharging;2024 IEEE International Conference on Robotics and Automation (ICRA);2024-05-13

2. GHQ: grouped hybrid Q-learning for cooperative heterogeneous multi-agent reinforcement learning;Complex & Intelligent Systems;2024-04-23

3. UAV-Based Warehouse Management Using Multi-Agent RL;Advances in Computational Intelligence and Robotics;2024-01-17

4. Multi-agent learning via gradient ascent activity-based credit assignment;Scientific Reports;2023-09-14

5. HSVI Can Solve Zero-Sum Partially Observable Stochastic Games;Dynamic Games and Applications;2023-09-02