Technical Note—The Method of Successive Approximations and Markovian Decision Problems
Arie Hordijk and
Henk Tijms
Additional contact information
Arie Hordijk: Mathematisch Centrum, Amsterdam, The Netherlands
Henk Tijms: Mathematisch Centrum, Amsterdam, The Netherlands
Operations Research, 1974, vol. 22, issue 3, 519-521
Abstract:
This note considers Howard's discrete-time Markovian decision model with the average return as criterion. Using results of Blackwell and MacQueen for the discounted return model it is shown in all generality that the Odoni bounds contain both the maximal average return and the average return of the current policy.
Date: 1974
References: Add references at CitEc
Citations:
Downloads: (external link)
http://dx.doi.org/10.1287/opre.22.3.519 (application/pdf)
Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:inm:oropre:v:22:y:1974:i:3:p:519-521
Access Statistics for this article
More articles in Operations Research from INFORMS Contact information at EDIRC.
Bibliographic data for series maintained by Chris Asher ().