AWESOME: A general multiagent learning algorithm that converges in self-play and learns a best response against stationary opponents-Reference-Cited by-同舟云学术

AWESOME: A general multiagent learning algorithm that converges in self-play and learns a best response against stationary opponents

Published:2006-09-18 Issue:1-2 Volume:67 Page:23-43
ISSN:0885-6125
Container-title:Machine Learning
language:en
Short-container-title:Mach Learn

Author:

Conitzer Vincent,Sandholm Tuomas

Publisher

Springer Science and Business Media LLC

Subject

Artificial Intelligence,Software

Link

http://link.springer.com/content/pdf/10.1007/s10994-006-0143-1.pdf

Reference58 articles.

1. Auer, P., Cesa-Bianchi, N., Freund, Y., & Schapire, R. E. (1995). Gambling in a rigged casino: The adversarial multi-arm bandit problem. In Proceedings of the Annual Symposium on Foundations of Computer Science (FOCS) (pp. 322–331).

2. Aumann, R. (1974). Subjectivity and correlation in randomized strategies. Journal of Mathematical Economics, 1, 67–96.

3. Banerjee, B., & Peng, J. (2004). Performance bounded reinforcement learning in strategic interactions. In Proceedings of the National Conference on Artificial Intelligence (AAAI) (pp. 2–7). San Jose, CA, USA.

4. Banerjee, B., Sen, S., & Peng, J. (2001). Fast concurrent reinforcement learners. In Proceedings of the Seventeenth International Joint Conference on Artificial Intelligence (IJCAI) (pp. 825–830). Seattle, WA.

5. Bowling, M. (2005). Convergence and no-regret in multiagent learning. In Proceedings of the Annual Conference on Neural Information Processing Systems (NIPS) (pp. 209–216). Vancouver, Canada.

Cited by 69 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. The Online Saddle Point Problem and Online Convex Optimization with Knapsacks;Mathematics of Operations Research;2024-01-12

2. Expected Lenient Q-learning: a fast variant of the Lenient Q-learning algorithm for cooperative stochastic Markov games;International Journal of Machine Learning and Cybernetics;2024-01-09

3. Smooth Q-Learning: An Algorithm for Independent Learners in Stochastic Cooperative Markov Games;Journal of Intelligent & Robotic Systems;2023-07-18

4. Online Markov decision processes with non-oblivious strategic adversary;Autonomous Agents and Multi-Agent Systems;2023-01-27

5. Modeling opponent learning in multiagent repeated games;Applied Intelligence;2022-12-23