Discounted semi‐Markov decision processes: linear programming and policy iteration
J. Wessels and
J. A. E. E. van Nunen
Statistica Neerlandica, 1975, vol. 29, issue 1, 1-7
Abstract:
For semi‐Markov decision processes with discounted rewards we derive the well known results regarding the structure of optimal strategies (nonrandomized, stationary Markov strategies) and the standard algorithms (linear programming, policy iteration). Our analysis is completely based on a primal linear programming formulation of the problem.
Date: 1975
References: Add references at CitEc
Citations:
Downloads: (external link)
https://doi.org/10.1111/j.1467-9574.1975.tb00238.x
Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:bla:stanee:v:29:y:1975:i:1:p:1-7
Ordering information: This journal article can be ordered from
http://www.blackwell ... bs.asp?ref=0039-0402
Access Statistics for this article
Statistica Neerlandica is currently edited by Miroslav Ristic, Marijtje van Duijn and Nan van Geloven
More articles in Statistica Neerlandica from Netherlands Society for Statistics and Operations Research
Bibliographic data for series maintained by Wiley Content Delivery ().