Abstract:
We propose a new and simple adaptive procedure for playing a game: "regret-matching." In this procedure, players depart from their current play with probabilities that are proportional to measures of regret for not having used other strategies in the past. It is shown that our adaptive procedure guarantees that, with probability one, the empirical distributions of play converge to the set of correlated equilibria of the game. To compute these regret measures, a player needs to know his payoff function and the history of play. We also offer a variation where every player knows only his own realized payoff history (but not his payoff function).
JEL-codes:C72D83 (search for similar items in EconPapers) Date: 1997-03-24, Revised 1997-11-25 Note: January 1997. Revised: October 1997. Paper + 3 figures (postscript). Also available at URL below View list of referencesView citations in EconPapers