A semimartingale characterization of average optimal stationary policies for Markov decision processes
Quanxin Zhu and
Xianping Guo
International Journal of Stochastic Analysis, 2006, vol. 2006, 1-8
Abstract:
This paper deals with discrete-time Markov decision processes with Borel state and action spaces. The criterion to be minimized is the average expected costs, and the costs may have neither upper nor lower bounds. In our former paper (to appear in Journal of Applied Probability), weaker conditions are proposed to ensure the existence of average optimal stationary policies. In this paper, we further study some properties of optimal policies. Under these weaker conditions, we not only obtain two necessary and sufficient conditions for optimal policies, but also give a semimartingale characterization of an average optimal stationary policy.
Date: 2006
References: Add references at CitEc
Citations:
Downloads: (external link)
http://downloads.hindawi.com/journals/IJSA/2006/081593.pdf (application/pdf)
http://downloads.hindawi.com/journals/IJSA/2006/081593.xml (text/xml)
Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:hin:jnijsa:081593
DOI: 10.1155/JAMSA/2006/81593
Access Statistics for this article
More articles in International Journal of Stochastic Analysis from Hindawi
Bibliographic data for series maintained by Mohamed Abdelhakeem ().