Some Monotonicity Results for Partially Observed Markov Decision Processes
William S. Lovejoy
Additional contact information
William S. Lovejoy: Stanford University, Stanford, California
Operations Research, 1987, vol. 35, issue 5, 736-743
Abstract:
This paper provides sufficient conditions for the optimal value in a discrete-time, finite, partially observed Markov decision process to be monotone on the space of state probability vectors ordered by likelihood ratios. The paper also presents sufficient conditions for the optimal policy to be monotone in a simple machine replacement problem, and, in the general case, for the optimal policy to be bounded from below by an easily calculated monotone function.
Keywords: 113; 118 partially observed Markov decision processes; 565 monotonicity results for partially observed Markov decision processes (search for similar items in EconPapers)
Date: 1987
References: Add references at CitEc
Citations: View citations in EconPapers (25)
Downloads: (external link)
http://dx.doi.org/10.1287/opre.35.5.736 (application/pdf)
Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:inm:oropre:v:35:y:1987:i:5:p:736-743
Access Statistics for this article
More articles in Operations Research from INFORMS Contact information at EDIRC.
Bibliographic data for series maintained by Chris Asher ().