Author:
Pan Jiateng,Yoshikawa Atsushi,Yamamura Masayuki
Abstract
It is widely accepted that rational individuals are unable to create cooperation in a prisoner’s dilemma. However, in everyday life, cooperation, for example, during a fishing moratorium, can be observed frequently. Additionally, the appearance of cooperation in the prisoner’s dilemma can be seen in numerous simulation studies.
This paper reviews 31 simulation studies published between January 2017 and January 2023 in which agents can be observed in the results to improve cooperation in a prisoner’s dilemma.
The proposed methodologies were sorted into seven categories, including Bounded Rationality, Memory, Adaptive Strategy, Mood Model, Intrinsic Reward, Network Dynamics, and Altruistic Attribute. Based on their impacts, the effectiveness of these seven approaches was classified into three categories: generating cooperation, maintaining cooperation, and spreading cooperation.
This review is expected to be helpful for scholars conducting future research on multi-agent cooperation and irrational agent modeling.
Reference48 articles.
1. Partner Selection for the Emergence of Cooperation in Multi-Agent Systems Using Reinforcement Learning
2. Effective Choice in the Prisoner's Dilemma
3. Campbell R., & Sowden L. (Eds.). (1985). Paradoxes of rationality and cooperation: prisoner’s dilemma and Newcomb’s problem. UBC Press.
4. Chong S. Y., Humble J., Kendall G., Li J., & Yao X. (2007). Iterated prisoner’s dilemma and evolutionary game theory. In The Iterated Prisoners’ Dilemma: 20 Years On (pp. 23-62).
5. Collenette J., Atkinson K., Bloembergen D., & Tuyls K. (2017, September). Mood modelling within reinforcement learning. In ECAL 2017, the Fourteenth European Conference on Artificial Life (pp. 106-113). MIT Press.