On the Convergence of Reinforcement Learning
Alan Beggs
No 96, Economics Series Working Papers from University of Oxford, Department of Economics
Abstract:
This paper examines the convergence of payoffs and strategies in Erev and Roth`s model of reinforcement learning. When all players use this rule it eliminates iteratively dominated strategies and in two-person constant-sum games average payoffs converge to the value of the game. Strategies converge in constant-sum games with unique equilibria if they are pure or in 2
Keywords: reinforcement learning; games (search for similar items in EconPapers)
JEL-codes: C72 D83 (search for similar items in EconPapers)
Date: 2002-03-01
References: Add references at CitEc
Citations: View citations in EconPapers (2)
Downloads: (external link)
https://ora.ox.ac.uk/objects/uuid:3bf646be-fa6a-478a-8355-4aff36bcad06 (text/html)
Related works:
Journal Article: On the convergence of reinforcement learning (2005) 
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:oxf:wpaper:96
Access Statistics for this paper
More papers in Economics Series Working Papers from University of Oxford, Department of Economics Contact information at EDIRC.
Bibliographic data for series maintained by Anne Pouliquen ( this e-mail address is bad, please contact ).