Risk-Averse Markov Decision Processes Through a Distributional Lens

Cheng, Ziteng; Jaimungal, Sebastian

Risk-Averse Markov Decision Processes Through a Distributional Lens

Ziteng Cheng () and Sebastian Jaimungal ()
Additional contact information
Ziteng Cheng: Department of Statistical Sciences, University of Toronto, Toronto, Ontario M5G 1Z5, Canada
Sebastian Jaimungal: Department of Statistical Sciences, University of Toronto, Toronto, Ontario M5G 1Z5, Canada

Mathematics of Operations Research, 2025, vol. 50, issue 3, 1707-1733

Abstract: By adopting a distributional viewpoint on law-invariant convex risk measures, we construct dynamic risk measures (DRMs) at the distributional level. We then apply these DRMs to investigate Markov decision processes, incorporating latent costs, random actions, and weakly continuous transition kernels. Furthermore, the proposed DRMs allow risk aversion to change dynamically. Under mild assumptions, we derive a dynamic programming principle and show the existence of an optimal policy in both finite and infinite time horizons. Moreover, we provide a sufficient condition for the optimality of deterministic actions. For illustration, we conclude the paper with examples from optimal liquidation with limit order books and autonomous driving.

Keywords: 90C39; 91G70; dynamic programming; Markov decision processes; risk measures; risk averse (search for similar items in EconPapers)
Date: 2025
References: Add references at CitEc
Citations:

Downloads: (external link)
http://dx.doi.org/10.1287/moor.2023.0211 (application/pdf)

Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.

Export reference: BibTeX RIS (EndNote, ProCite, RefMan) HTML/Text

Persistent link: https://EconPapers.repec.org/RePEc:inm:ormoor:v:50:y:2025:i:3:p:1707-1733

Access Statistics for this article

More articles in Mathematics of Operations Research from INFORMS Contact information at EDIRC.
Bibliographic data for series maintained by Chris Asher ().