An expected average reward criterion
K.-J. Bierth
Stochastic Processes and their Applications, 1987, vol. 26, 123-140
Abstract:
In the present paper the expected average reward criterion is considered instead of the average expected reward criterion commonly used in stochastic dynamic programming. This new criterion seems to be more natural and yields stronger results. In addition to the theory of Markov chains, the theory of martingales will be used. This paper is concerned with Markov decision models with finite state space, arbitrary action space and bounded reward functions. In such a model there is always available a Markov policy which almost maximizes the average reward over a unit of time for different criteria. If the action space is a compact metric space there is even a stationary policy with the same property; further if a stationary policy is optimal for one criterion then this policy is optimal for all average reward criteria. Thus the paper solves some problems posed by Demko and Hill (1984).
Keywords: Markov; decision; models; dynamic; programming; average; reward; criteria; ([epsilon]-)optimal; policies; Markov; policies; stationary; policies (search for similar items in EconPapers)
Date: 1987
References: Add references at CitEc
Citations: View citations in EconPapers (1)
Downloads: (external link)
http://www.sciencedirect.com/science/article/pii/0304-4149(87)90055-X
Full text for ScienceDirect subscribers only
Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:eee:spapps:v:26:y:1987:i::p:123-140
Ordering information: This journal article can be ordered from
http://http://www.elsevier.com/wps/find/supportfaq.cws_home/regional
https://shop.elsevie ... _01_ooc_1&version=01
Access Statistics for this article
Stochastic Processes and their Applications is currently edited by T. Mikosch
More articles in Stochastic Processes and their Applications from Elsevier
Bibliographic data for series maintained by Catherine Liu ().