Reinforcement learning produces dominant strategies for the Iterated Prisoner’s Dilemma
Marc Harper,
Vincent Knight,
Martin Jones,
Georgios Koutsovoulos,
Nikoleta E Glynatsi and
Owen Campbell
PLOS ONE, 2017, vol. 12, issue 12, 1-33
Abstract:
We present tournament results and several powerful strategies for the Iterated Prisoner’s Dilemma created using reinforcement learning techniques (evolutionary and particle swarm algorithms). These strategies are trained to perform well against a corpus of over 170 distinct opponents, including many well-known and classic strategies. All the trained strategies win standard tournaments against the total collection of other opponents. The trained strategies and one particular human made designed strategy are the top performers in noisy tournaments also.
Date: 2017
References: View references in EconPapers View complete reference list from CitEc
Citations: View citations in EconPapers (5)
Downloads: (external link)
https://journals.plos.org/plosone/article?id=10.1371/journal.pone.0188046 (text/html)
https://journals.plos.org/plosone/article/file?id= ... 88046&type=printable (application/pdf)
Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:plo:pone00:0188046
DOI: 10.1371/journal.pone.0188046
Access Statistics for this article
More articles in PLOS ONE from Public Library of Science
Bibliographic data for series maintained by plosone ().