Algorithmic cooperation: A comparison with human play in the infinitely repeated prisoner's dilemma
Bernhard Kasberger,
Simon Martin,
Hans-Theo Normann and
Tobias Werner
No 437, DICE Discussion Papers from Heinrich Heine University Düsseldorf, Düsseldorf Institute for Competition Economics (DICE)
Abstract:
Reinforcement learning algorithms play an increasingly important role in economic situations. These situations are often strategic, and the artificial intelligence may or may not be cooperative. We compare human and algorithmic cooperation rates in the infinitely repeated two-player prisoner's dilemma and study which strategies they choose to cooperate and punish deviations. Through a sequence of computational Q-learning and human-player experiments, we find that our Q-learning algorithms tend to cooperate less than humans, particularly when cooperation is risky or not incentive-compatible. Algorithms often use different strategies than humans, leading to distinct on- and off-path behavior.
Keywords: Artificial intelligence; cooperation; Q-learning; repeated prisoner's dilemma (search for similar items in EconPapers)
JEL-codes: C72 C73 C92 D83 (search for similar items in EconPapers)
Date: 2026
References: Add references at CitEc
Citations:
Downloads: (external link)
https://www.econstor.eu/bitstream/10419/341427/1/1972802348.pdf (application/pdf)
Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:zbw:dicedp:341427
Access Statistics for this paper
More papers in DICE Discussion Papers from Heinrich Heine University Düsseldorf, Düsseldorf Institute for Competition Economics (DICE) Contact information at EDIRC.
Bibliographic data for series maintained by ZBW - Leibniz Information Centre for Economics ().