EconPapers    
Economics at your fingertips  
 

Regret minimization in repeated matrix games with variable stage duration

Shie Mannor and Nahum Shimkin

Games and Economic Behavior, 2008, vol. 63, issue 1, pages 227-258

Abstract: Regret minimization in repeated matrix games has been extensively studied ever since Hannan's seminal paper [Hannan, J., 1957. Approximation to Bayes risk in repeated play. In: Dresher, M., Tucker, A.W., Wolfe, P. (Eds.), Contributions to the Theory of Games, vol. III. Ann. of Math. Stud., vol. 39, Princeton Univ. Press, Princeton, NJ, pp. 97-193]. Several classes of no-regret strategies now exist; such strategies secure a long-term average payoff as high as could be obtained by the fixed action that is best, in hindsight, against the observed action sequence of the opponent. We consider an extension of this framework to repeated games with variable stage duration, where the duration of each stage may depend on actions of both players, and the performance measure of interest is the average payoff per unit time. We start by showing that no-regret strategies, in the above sense, do not exist in general. Consequently, we consider two classes of adaptive strategies, one based on Blackwell's approachability theorem and the other on calibrated play, and examine their performance guarantees. We further provide sufficient conditions for existence of no-regret strategies in this model.

Downloads: (external link)
http://www.sciencedi ... 0856c5758e8314acb646
Full text for ScienceDirect subscribers only

Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.

Access Statistics for this article

Games and Economic Behavior is edited by E. Kalai

More articles in Games and Economic Behavior from Elsevier
Series data maintained by Heidi Boesdal ().

 
Page updated 2008-07-12
Handle: RePEc:eee:gamebe:v:63:y:2008:i:1:p:227-258