Computational Performance of Deep Reinforcement Learning to Find Nash Equilibria-Reference-Cited by-同舟云学术

Computational Performance of Deep Reinforcement Learning to Find Nash Equilibria

Published:2023-01-03 Issue:2 Volume:63 Page:529-576
ISSN:0927-7099
Container-title:Computational Economics
language:en
Short-container-title:Comput Econ

Author:

Graf Christoph,Zobernig Viktor,Schmidt Johannes,Klöckl Claude^ORCID

Abstract

AbstractWe test the performance of deep deterministic policy gradient—a deep reinforcement learning algorithm, able to handle continuous state and action spaces—to find Nash equilibria in a setting where firms compete in offer prices through a uniform price auction. These algorithms are typically considered “model-free” although a large set of parameters is utilized by the algorithm. These parameters may include learning rates, memory buffers, state space dimensioning, normalizations, or noise decay rates, and the purpose of this work is to systematically test the effect of these parameter configurations on convergence to the analytically derived Bertrand equilibrium. We find parameter choices that can reach convergence rates of up to 99%. We show that the algorithm also converges in more complex settings with multiple players and different cost structures. Its reliable convergence may make the method a useful tool to studying strategic behavior of firms even in more complex settings.

Funder

Oesterreichische Nationalbank

Austrian Science Fund

H2020 European Research Council

Publisher

Springer Science and Business Media LLC

Subject

Computer Science Applications,Economics, Econometrics and Finance (miscellaneous)

Link

https://link.springer.com/content/pdf/10.1007/s10614-022-10351-6.pdf

Reference60 articles.

1. Adami, C., Schossau, J., & Hintze, A. (2016). Evolutionary game theory using agent-based methods. Physics of Life Reviews, 19, 1–26. https://doi.org/10.1016/j.plrev.2016.08.015.

2. Aliabadi, D. E., Kaya, M., & Şahin, G. (2017). An agent-based simulation of power generation company behavior in electricity markets under different market-clearing mechanisms. Energy Policy, 100, 191–205. https://doi.org/10.1016/j.enpol.2016.09.063.

3. Andreoni, J., & Miller, J. H. (1995). Auctions with artificial adaptive agents. Games and Economic Behavior, 10(1), 39–64.

4. Asker, J., Fershtman, C., & Pakes, A. (2021). Artificial intelligence and pricing: The impact of algorithm design. Technical report, National Bureau of Economic Research. https://www.nber.org/system/files/working_papers/w28535/w28535.pdf

5. Awerbuch, B., Azar, Y., Epstein, A., Mirrokni, V. S., & Skopalik, A. (2008).Fast convergence to nearly optimal solutions in potential games. In Proceedings of the 9th ACM conference on Electronic commerce, pp. 264–273. https://doi.org/10.1145/1386790.1386832

Cited by 1 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Correction to: Computational Performance of Deep Reinforcement Learning to Find Nash Equilibria;Computational Economics;2023-06-15