A reinforcement learning process in extensive form games
Jean-François Laslier and
Bernard Walliser
Post-Print from HAL
Abstract:
The CPR ("cumulative proportional reinforcement") learning rule stipulates that an agent chooses a move with a probability proportional to the cumulative payoff she obtained in the past with that move. Previously considered for strategies in normal form games (Laslier, Topol and Walliser, Games and Econ. Behav., 2001), the CPR rule is here adapted for actions in perfect information extensive form games. The paper shows that the action-based CPR process converges with probability one to the (unique) subgame perfect equilibrium.
Keywords: Learning; Polya process; Reinforcement; Subgame Perfect Equilibrium (search for similar items in EconPapers)
Date: 2005-06
References: Add references at CitEc
Citations: View citations in EconPapers (7)
Published in International Journal of Game Theory, 2005, 33 (2), pp.219-227. ⟨10.1007/s001820400194⟩
There are no downloads for this item, see the EconPapers FAQ for hints about obtaining it.
Related works:
Journal Article: A reinforcement learning process in extensive form games (2005) 
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:hal:journl:halshs-00754083
DOI: 10.1007/s001820400194
Access Statistics for this paper
More papers in Post-Print from HAL
Bibliographic data for series maintained by CCSD ().