Limits and limitations of no-regret learning in games-Reference-Cited by-同舟云学术

Limits and limitations of no-regret learning in games

Published:2017 Issue: Volume:32 Page:
ISSN:0269-8889
Container-title:The Knowledge Engineering Review
language:en
Short-container-title:The Knowledge Engineering Review

Author:

Monnot Barnabé,Piliouras Georgios

Abstract

AbstractWe study the limit behavior and performance of no-regret dynamics in general game theoretic settings. We design protocols that achieve both good regret and equilibration guarantees in general games. We also establish a strong equivalence between them and coarse correlated equilibria (CCE). We examine structured game settings where stronger properties can be established for no-regret dynamics and CCE. In congestion games with non-atomic agents (each contributing a fraction of the flow), as we decrease the individual flow of agents, CCE become closely concentrated around the unique equilibrium flow of the non-atomic game. Moreover, we compare best/worst case no-regret learning behavior to best/worst case Nash equilibrium (NE) in small games. We prove analytical bounds on these inefficiency ratios for 2×2 games and unboundedness for larger games. Experimentally, we sample normal form games and compute their measures of inefficiency. We show that the ratio distribution has sharp decay, in the sense that most generated games have small ratios. They also exhibit strong anti-correlation between each other, that is games with large improvements from the best NE to the best CCE present small degradation from the worst NE to the worst CCE.

Publisher

Cambridge University Press (CUP)

Subject

Artificial Intelligence,Software

Reference27 articles.

1. Greenwald A. & Jafari A. 2003. A general class of no-regret learning algorithms and game-theoretic equilibria. In Learning Theory and Kernel Machines, 2–12. Springer.

2. Koutsoupias E. & Papadimitriou C. H. 1999. Worst-case equilibria. In STACS, 404–413.

3. Blum A. , Hajiaghayi M. , Ligett K. & Roth A. 2008. Regret minimization and the price of total anarchy. In Proceedings of the Fortieth Annual ACM Symposium on Theory of Computing, 373–382. ACM.

4. If multi-agent learning is the answer, what is the question?

5. Hart S. & Mansour Y. 2007. The communication complexity of uncoupled Nash equilibrium procedures. In Proceedings of the Thirty-Ninth Annual ACM Symposium on Theory of Computing, 345–353. ACM.

Cited by 3 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Auctions between Regret-Minimizing Agents;Proceedings of the ACM Web Conference 2022;2022-04-25

2. On modeling blockchain-enabled economic networks as stochastic dynamical systems;Applied Network Science;2020-03-19

3. Routing Games in the Wild: Efficiency, Equilibration and Regret;Web and Internet Economics;2017