Estimation of strategies in a markov game
Jerzy A. Filar
Naval Research Logistics Quarterly, 1976, vol. 23, issue 3, 469-480
Abstract:
In this paper a two‐person Markov game, in discrete time, and with perfect state information, is considered from the point of view of a single player (player A) only. It is assumed that A's opponent (player B) uses the same strategy every time the game is played. It is shown that A can obtain a consistent estimate of B's strategy on the basis of his past experience of playing the game with B. Two methods of deriving such an estimate are given. Further, it is shown that using one of these estimates A can construct a strategy for himself which is asymptotically optimal. A simple example of a game in which the above method may be useful is given.
Date: 1976
References: Add references at CitEc
Citations:
Downloads: (external link)
https://doi.org/10.1002/nav.3800230309
Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:wly:navlog:v:23:y:1976:i:3:p:469-480
Access Statistics for this article
More articles in Naval Research Logistics Quarterly from John Wiley & Sons
Bibliographic data for series maintained by Wiley Content Delivery ().