EconPapers    
Economics at your fingertips  
 

Learning to play games in extensive form by valuation

Philippe Jehiel () and Dov Samet

Game Theory and Information from EconWPA

Abstract: A valuation for a board game is an assignment of numeric values to different states of the board. The valuation reflects the desirability of the states for the player. It can be used by a player to decide on her next move during the play. We assume a myopic player, who chooses a move with the highest valuation. Valuations can also be revised, and hopefully improved, after each play of the game. Here, a very simple valuation revision is considered, in which the states of the board visited in a play are assigned the payoff obtained in the play. We show that by adopting such a learning process a player who has a winning strategy in a win-lose game can almost surely guarantee a win in a repeated game. When a player has more than two payoffs, a more elaborate learning procedure is required. We consider one that associates with each state the average payoff in the rounds in which this node was reached. When all players adopt this learning procedure, with some perturbations, then, with probability 1, strategies that are close to subgame perfect equilibrium are played after some time. A single player who adopts this procedure can guarantee only her individually rational payoff.

Keywords: reinforcement; learning (search for similar items in EconPapers)
JEL-codes: C7 D8 (search for similar items in EconPapers)
Date: 2001-01-01
Note: Type of Document - ; pages: 18
View list of references View citations in EconPapers

Downloads: (external link)
http://129.3.20.41/eps/game/papers/0012/0012001.pdf (application/pdf)

Related works:
Working Paper: Learning To Play Games In Extensive Form By Valuation (2001) Downloads
Journal Article: Learning to play games in extensive form by valuation (2005) Downloads
This item may be available elsewhere in EconPapers: Search for items with the same title.

Export reference: BibTeX RIS (EndNote, ProCite, RefMan) HTML/Text

Persistent link: http://EconPapers.repec.org/RePEc:wpa:wuwpga:0012001

Access Statistics for this paper

More papers in Game Theory and Information from EconWPA
Series data maintained by EconWPA ().

 
Page updated 2009-11-26
Handle: RePEc:wpa:wuwpga:0012001