Markov Decision Processes with Imprecise Transition Probabilities
Chelsea C. White and
Hany K. Eldeib
Additional contact information
Chelsea C. White: University of Michigan, Ann Arbor, Michigan
Hany K. Eldeib: INTELSAT Corporation, Washington, D.C.
Operations Research, 1994, vol. 42, issue 4, 739-749
Abstract:
We present new numerical algorithms and bounds for the infinite horizon, discrete stage, finite state and action Markov decision process with imprecise transition probabilities. We assume that the transition probability mass vector for each state and action is described by a finite number of linear inequalities. This model of imprecision appears to be well suited for describing statistically determined confidence limits and/or natural language statements of likelihood. The numerical procedures for calculating an optimal max-min strategy are based on successive approximations, reward revision, and modified policy iteration. The bounds that are determined are at least as tight as currently available bounds for the case where the transition probabilities are precise.
Keywords: dynamic; programming/optimal; control:; Markov; decision; processes (search for similar items in EconPapers)
Date: 1994
References: Add references at CitEc
Citations: View citations in EconPapers (18)
Downloads: (external link)
http://dx.doi.org/10.1287/opre.42.4.739 (application/pdf)
Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:inm:oropre:v:42:y:1994:i:4:p:739-749
Access Statistics for this article
More articles in Operations Research from INFORMS Contact information at EDIRC.
Bibliographic data for series maintained by Chris Asher ().