Constrained Markov Decision Processes with Non-constant Discount Factor
Héctor Jasso-Fuentes () and
Tomás Prieto-Rumeau ()
Additional contact information
Héctor Jasso-Fuentes: CINVESTAV-IPN
Tomás Prieto-Rumeau: UNED
Journal of Optimization Theory and Applications, 2024, vol. 202, issue 2, No 15, 897-931
Abstract:
Abstract This paper studies constrained Markov decision processes under the total expected discounted cost optimality criterion, with a state-action dependent discount factor that may take any value between zero and one. Both the state and the action space are assumed to be Borel spaces. By using the linear programming approach, consisting in stating the control problem as a linear problem on a set of occupation measures, we show the existence of an optimal stationary Markov policy. Our results are based on the study of both weak-strong topologies in the space of occupation measures and Young measures in the space of Markov policies.
Keywords: Markov decision processes; Constrained control problems; Occupation measures; Linear programming; 93E20; 90C40; 60J05 (search for similar items in EconPapers)
Date: 2024
References: View references in EconPapers View complete reference list from CitEc
Citations:
Downloads: (external link)
http://link.springer.com/10.1007/s10957-024-02453-y Abstract (text/html)
Access to the full text of the articles in this series is restricted.
Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:spr:joptap:v:202:y:2024:i:2:d:10.1007_s10957-024-02453-y
Ordering information: This journal article can be ordered from
http://www.springer. ... cs/journal/10957/PS2
DOI: 10.1007/s10957-024-02453-y
Access Statistics for this article
Journal of Optimization Theory and Applications is currently edited by Franco Giannessi and David G. Hull
More articles in Journal of Optimization Theory and Applications from Springer
Bibliographic data for series maintained by Sonal Shukla () and Springer Nature Abstracting and Indexing ().