Nonuniqueness versus Uniqueness of Optimal Policies in Convex Discounted Markov Decision Processes

de-Oca, Raúl Montes-; Lemus-Rodríguez, Enrique; Salem-Silva, Francisco Sergio

Nonuniqueness versus Uniqueness of Optimal Policies in Convex Discounted Markov Decision Processes

Raúl Montes- de-Oca, Enrique Lemus-Rodríguez and Francisco Sergio Salem-Silva

Journal of Applied Mathematics, 2013, vol. 2013, issue 1

Abstract: From the classical point of view, it is important to determine if in a Markov decision process (MDP), besides their existence, the uniqueness of the optimal policies is guaranteed. It is well known that uniqueness does not always hold in optimization problems (for instance, in linear programming). On the other hand, in such problems it is possible for a slight perturbation of the functional cost to restore the uniqueness. In this paper, it is proved that the value functions of an MDP and its cost perturbed version stay close, under adequate conditions, which in some sense is a priority. We are interested in the stability of Markov decision processes with respect to the perturbations of the cost‐as‐you‐go function.

Date: 2013
References: Add references at CitEc
Citations:

Downloads: (external link)
https://doi.org/10.1155/2013/271279

Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.

Export reference: BibTeX RIS (EndNote, ProCite, RefMan) HTML/Text

Persistent link: https://EconPapers.repec.org/RePEc:wly:jnljam:v:2013:y:2013:i:1:n:271279

Access Statistics for this article

More articles in Journal of Applied Mathematics from John Wiley & Sons
Bibliographic data for series maintained by Wiley Content Delivery ().