Computational Methods for Risk-Averse Undiscounted Transient Markov Models
Özlem Çavuş () and
Andrzej Ruszczynski ()
Additional contact information
Özlem Çavuş: Department of Industrial Engineering, Bilkent University, Ankara 06800, Turkey
Operations Research, 2014, vol. 62, issue 2, 401-417
Abstract:
The total cost problem for discrete-time controlled transient Markov models is considered. The objective functional is a Markov dynamic risk measure of the total cost. Two solution methods, value and policy iteration, are proposed, and their convergence is analyzed. In the policy iteration method, we propose two algorithms for policy evaluation: the nonsmooth Newton method and convex programming, and we prove their convergence. The results are illustrated on a credit limit control problem.
Keywords: dynamic programming; risk measures; transient Markov models; value iteration; policy iteration (search for similar items in EconPapers)
Date: 2014
References: View references in EconPapers View complete reference list from CitEc
Citations: View citations in EconPapers (1)
Downloads: (external link)
http://dx.doi.org/10.1287/opre.2013.1251 (application/pdf)
Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:inm:oropre:v:62:y:2014:i:2:p:401-417
Access Statistics for this article
More articles in Operations Research from INFORMS Contact information at EDIRC.
Bibliographic data for series maintained by Chris Asher ().