The double-edged sword effect of conformity on cooperation in spatial Prisoner’s Dilemma Games with reinforcement learning
Pai Wang and
Zhihu Yang
Chaos, Solitons & Fractals, 2024, vol. 187, issue C
Abstract:
Imitation based on fitness comparison has long been a representative strategy updating method in evolutionary game theory, with the pursuit of profit maximization at its core. However, the method fails when obtaining other agents’ income information is inaccessible or prohibitively expensive. As an alternative, reinforcement learning has been frequently used to alleviate this problem, yet it rarely achieves socially optimal outcomes. To fill this gap, this study proposes a self-regarding Q-learning with conformity effect and investigates its impact on the evolution of cooperation in a spatial Prisoner’s Dilemma Game. The instant reward of Q-learning is re-scaled following the logic that the more popular the strategy, the higher the re-scaled reward. The results reveal that reinforcement learning can alleviate social dilemmas in a way that prevents both sides of a game from freezing in a dilemma of mutual defection, thereby facilitating the coexistence of cooperation and defection. Depending on the dilemma strength, intriguingly, conformity has a two-sided effect on the evolution of cooperation. For lower b-values it promotes cooperation whereas for higher b-values it hinders cooperation. The reasons behind these phenomena are analyzed and the simulation results are shown to be consistent with the theoretical analysis.
Keywords: Prisoner’s Dilemma Game; Evolution of cooperation; Reinforcement learning; Conformity (search for similar items in EconPapers)
Date: 2024
References: View references in EconPapers View complete reference list from CitEc
Citations:
Downloads: (external link)
http://www.sciencedirect.com/science/article/pii/S096007792401035X
Full text for ScienceDirect subscribers only
Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:eee:chsofr:v:187:y:2024:i:c:s096007792401035x
DOI: 10.1016/j.chaos.2024.115483
Access Statistics for this article
Chaos, Solitons & Fractals is currently edited by Stefano Boccaletti and Stelios Bekiros
More articles in Chaos, Solitons & Fractals from Elsevier
Bibliographic data for series maintained by Thayer, Thomas R. ().