Q-learning facilitates norm emergence in metanorm game model with topological structures

Zhang, Wei; Zhao, Dongkai; Jin, Xing; Zhang, Huizhen; An, Tianbo; Cui, Guanghai; Wang, Zhen

Q-learning facilitates norm emergence in metanorm game model with topological structures

Wei Zhang, Dongkai Zhao, Xing Jin, Huizhen Zhang, Tianbo An, Guanghai Cui and Zhen Wang

Chaos, Solitons & Fractals, 2025, vol. 195, issue C

Abstract: Axelrod’s model and its subsequent studies have become a valuable framework for fostering cooperation norms among self-interested agents. Within this framework, the concepts of “boldness” and “vengefulness” are specifically employed to characterize agents’ behaviors in terms of cooperation and punishment (including metapunishment). Describing behavior solely through the parameters B and V may be overly simplistic and lacks generalizability, making it difficult to apply to other scenarios. Moreover, privacy concerns and the difficulty of evaluating complex states in real-world scenarios limit agents’ access to detailed payoff information from their neighbors. To address these questions, our paper employs self-regarding Q-learning, a well-established method for examining the dynamics of strategy updates and agents’ learning processes, to investigate whether metanorms can naturally emerge through players’ strategy selection. Through extensive experiments, we observe cooperative norms’ successful emergence driven by agents’ strategy selection variations. Over 90% of agents choose to cooperate on average. In subsequent analyses, we explore the underlying reasons for the emergence of cooperative norms from perspectives of changes in Q-values, punishment and metapunishment frequencies. Additionally, we examine the impact of topological structures on players’ strategy selection and assess the emergence of norms across different temptation levels, population sizes, and regulatory intensity levels to validate the model’s sensitivity.

Keywords: Social dilemma; Punishment mechanism; Metanorm game; Self-regarding Q-learning; Metapunishment; Norm emergence (search for similar items in EconPapers)
Date: 2025
References: View references in EconPapers View complete reference list from CitEc
Citations:

Downloads: (external link)
http://www.sciencedirect.com/science/article/pii/S0960077925003108
Full text for ScienceDirect subscribers only

Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.

Export reference: BibTeX RIS (EndNote, ProCite, RefMan) HTML/Text

Persistent link: https://EconPapers.repec.org/RePEc:eee:chsofr:v:195:y:2025:i:c:s0960077925003108

DOI: 10.1016/j.chaos.2025.116297

Access Statistics for this article

Chaos, Solitons & Fractals is currently edited by Stefano Boccaletti and Stelios Bekiros

More articles in Chaos, Solitons & Fractals from Elsevier
Bibliographic data for series maintained by Thayer, Thomas R. ().