A reinforcement learning process in extensive form games

Laslier, Jean-François; Walliser, Bernard

A reinforcement learning process in extensive form games

Jean-François Laslier and Bernard Walliser

Abstract: The CPR ("cumulative proportional reinforcement") learning rule stipulates that an agent chooses a move with a probability proportional to the cumulative payoff she obtained in the past with that move. Previously considered for strategies in normal form games (Laslier, Topol and Walliser, Games and Econ. Behav., 2001), the CPR rule is here adapted for actions in perfect information extensive form games. The paper shows that the action-based CPR process converges with probability one to the (unique) subgame perfect equilibrium.

Keywords: Learning; Polya process; Reinforcement; Subgame Perfect Equilibrium (search for similar items in EconPapers)
Date: 2005-06
References: Add references at CitEc
Citations: View citations in EconPapers (7)

Published in International Journal of Game Theory, 2005, 33 (2), pp.219-227. ⟨10.1007/s001820400194⟩

There are no downloads for this item, see the EconPapers FAQ for hints about obtaining it.

Related works:
Journal Article: A reinforcement learning process in extensive form games (2005)
This item may be available elsewhere in EconPapers: Search for items with the same title.

Export reference: BibTeX RIS (EndNote, ProCite, RefMan) HTML/Text

Persistent link: https://EconPapers.repec.org/RePEc:hal:journl:halshs-00754083

DOI: 10.1007/s001820400194

Access Statistics for this paper

More papers in Post-Print from HAL
Bibliographic data for series maintained by CCSD ().