The reinforcement learning model with heterogeneous learning rate in activity-driven networks
-
Published:2023-01-05
Issue:
Volume:
Page:
-
ISSN:0129-1831
-
Container-title:International Journal of Modern Physics C
-
language:en
-
Short-container-title:Int. J. Mod. Phys. C
Affiliation:
1. School of Mathematical Sciences, Jiangsu University, Zhenjiang, Jiangsu 212013, P. R. China
Abstract
Agent’s learning behavior usually presents biased judgments influenced by many internal and external reasons, we incorporate an improved [Formula: see text]-learning algorithm in the reinforcement learning which is examined with the prisoner’s dilemma game in an activity-driven networks. The heterogeneous learning rate and [Formula: see text]-greedy exploration mechanism are taken into account while modeling decision-making of agents. Simulation results show the proposed reinforcement learning mechanism is conducive to the emergence of defective behavior, i.e. it could maximize one’s expected payoff regardless of its neighbors’ strategy. In addition, we find the temptation gain, vision level and the number of connected edges of activated agents are proportional to the density of defectors. Interestingly, when the inherent learning rate is small, the increase of exploration rate can demote the appearance of defectors, and the decrease of defectors is insignificant by increasing of exploration rate conversely.
Funder
National Natural Science Foundation of China
Natural Science Foundation of Shanghai
Publisher
World Scientific Pub Co Pte Ltd
Subject
Computational Theory and Mathematics,Computer Science Applications,General Physics and Astronomy,Mathematical Physics,Statistical and Nonlinear Physics
Cited by
1 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献