Constrained Risk-Sensitive Markov Decision Chains
Karel Sladký
Chapter 59 in Operations Research Proceedings 2008, 2009, pp 363-368 from Springer
Abstract:
Summary We consider a Markov decision chain $${X = \{ X_n ,n = 0,1 \cdots \}}$$ with finite state space I = {1; 2;…;N} and finite set A i = {1; 2;…;K i} of possible decisions (actions) in state i Є I. posing that in state i Є I action a ЄA i is selected, then state j is reached in the next transition with a given probability p ij (a) and one-stage transition rewards r ij (a) and s ij (a) will be accrued to such transition. We shall suppose that the streams of transition rewards are evaluated by an exponential utility function, say $${u^\gamma ( \cdot )}$$ , with risk aversion coefficient γ 0 (the risk seeking case). Then the utility assigned to the (random) reward ξ is given by $${u^{\rm \gamma } (\xi ): = {\rm sign (\gamma ) exp (\gamma \xi )}}$$ .
Date: 2009
References: Add references at CitEc
Citations:
There are no downloads for this item, see the EconPapers FAQ for hints about obtaining it.
Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:spr:sprchp:978-3-642-00142-0_59
Ordering information: This item can be ordered from
http://www.springer.com/9783642001420
DOI: 10.1007/978-3-642-00142-0_59
Access Statistics for this chapter
More chapters in Springer Books from Springer
Bibliographic data for series maintained by Sonal Shukla () and Springer Nature Abstracting and Indexing ().