Approximation of average cost optimal policies for general Markov decision processes with unbounded costs

Gordienko, Evgueni; De-Oca, Raúl Montes-; Minjárez-Sosa, Adolfo

Approximation of average cost optimal policies for general Markov decision processes with unbounded costs

Evgueni Gordienko, Raúl Montes- De-Oca and Adolfo Minjárez-Sosa

Mathematical Methods of Operations Research, 1997, vol. 45, issue 2, 245-263

Abstract: The aim of the paper is to show that Lyapunov-like ergodicity conditions on Markov decision processes with Borel state space and possibly unbounded cost provide the approximation of an average cost optimal policy by solvingn-stage optimization problems (n=1, 2, ...). The used approach ensures the exponential rate of convergence. The approximation of this type would be useful to find adaptive procedures of control and to estimate stability of an optimal control under disturbances of the transition probability. Copyright Physica-Verlag 1997

Keywords: Markov Decision Process; Average Cost Criterion; Value Iteration; Approximation of Optimal Policy; Geometrical Convergence (search for similar items in EconPapers)
Date: 1997
References: View complete reference list from CitEc
Citations:

Downloads: (external link)
http://hdl.handle.net/10.1007/BF01193864 (text/html)
Access to full text is restricted to subscribers.

Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.

Export reference: BibTeX RIS (EndNote, ProCite, RefMan) HTML/Text

Persistent link: https://EconPapers.repec.org/RePEc:spr:mathme:v:45:y:1997:i:2:p:245-263

Ordering information: This journal article can be ordered from
http://www.springer.com/economics/journal/00186

DOI: 10.1007/BF01193864

Access Statistics for this article

Mathematical Methods of Operations Research is currently edited by Oliver Stein

More articles in Mathematical Methods of Operations Research from Springer, Gesellschaft für Operations Research (GOR), Nederlands Genootschap voor Besliskunde (NGB)
Bibliographic data for series maintained by Sonal Shukla () and Springer Nature Abstracting and Indexing ().