Optimal and Nearly Optimal Policies in Markov Decision Chains with Nonnegative Rewards and Risk-Sensitive Expected Total-Reward Criterion

Cavazos-Cadena, Rolando; de-Oca, Raúl Montes-

Optimal and Nearly Optimal Policies in Markov Decision Chains with Nonnegative Rewards and Risk-Sensitive Expected Total-Reward Criterion

Rolando Cavazos-Cadena () and Raúl Montes- de-Oca ()
Additional contact information
Rolando Cavazos-Cadena: Universidad Autónoma Agraria Antonio Narro, Departamento de Estadística y Cálculo
Raúl Montes- de-Oca: Universidad Autónoma Metropolitana, Departamento de Matemáticas

Chapter Chapter 11 in Markov Processes and Controlled Markov Chains, 2002, pp 189-221 from Springer

Abstract: Abstract This work considers Markov decision processes with discrete state space. Assuming that the decision maker has a non-null constant risk-sensitivity, which leads to grade random rewards via the expectation of an exponential utility function, the performance index of a control policy is the risk-sensitive expected total-reward criterion corresponding to a nonnegative reward function. Within this framework, the existence of optimal and approximately optimal stationary policies in the absolute sense is studied. The main results can be summarised as follows: (i) An optimal stationary policy exists if the state and actions sets are finite, whereas an ε-optimal stationary policy is guaranteed when just the state space is finite. (ii) This latter fact is used to obtain, for the general denumerable state space case, that ε-optimal stationary policies exist if the controller is risk-seeking and the optimal value function is bounded. In contrast with the usual approach, the analysis performed in the paper does not involve the discounted criterion, and is completely based on properties of optimal value function, particularly, on the the strong optimality equation.

Keywords: Utility function; constant risk-sensitivity; Ornstein’s theorem; strong optimality equation; risk-seeking controller (search for similar items in EconPapers)
Date: 2002
References: Add references at CitEc
Citations:

There are no downloads for this item, see the EconPapers FAQ for hints about obtaining it.

Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.

Export reference: BibTeX RIS (EndNote, ProCite, RefMan) HTML/Text

Persistent link: https://EconPapers.repec.org/RePEc:spr:sprchp:978-1-4613-0265-0_11

Ordering information: This item can be ordered from
http://www.springer.com/9781461302650

DOI: 10.1007/978-1-4613-0265-0_11

Access Statistics for this chapter

More chapters in Springer Books from Springer
Bibliographic data for series maintained by Sonal Shukla () and Springer Nature Abstracting and Indexing ().