Two Person Zero-Sum Sequential Stochastic Games with Imperfect and Incomplete Information—General Case
S. Lakshmivarahan
Additional contact information
S. Lakshmivarahan: University of Oklahoma, School of Electrical Engineering and Computer Science
Chapter Chapter 6 in Learning Algorithms Theory and Applications, 1981, pp 168-196 from Springer
Abstract:
Abstract In this chapter we present a unified approach to two person zero sum games with incomplete and imperfect information wherein the game matrix may not always have a sadlle point in pure strategies. This is a natural extension of the problem of Chapter 5. Under the assumption that both players A and B use the LER−P learning algorithm with the same reward and penalty parameters but the penalty parameter being very small compared to the reward parameter, it is shown that the expected mixed strategy of either player can be made, asymptotically, as close to the optimal strategy dictated by the game theory as desired, irrespective of whether or not the game matrix has a saddle point in pure strategies.
Keywords: Saddle Point; Mixed Strategy; Pure Strategy; Penalty Parameter; Game Matrix (search for similar items in EconPapers)
Date: 1981
References: Add references at CitEc
Citations:
There are no downloads for this item, see the EconPapers FAQ for hints about obtaining it.
Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:spr:sprchp:978-1-4612-5975-6_6
Ordering information: This item can be ordered from
http://www.springer.com/9781461259756
DOI: 10.1007/978-1-4612-5975-6_6
Access Statistics for this chapter
More chapters in Springer Books from Springer
Bibliographic data for series maintained by Sonal Shukla () and Springer Nature Abstracting and Indexing ().