Exact solution of the Bellman equation for a β -discounted reward in a two-armed bandit with switching arms
Doncho S. Donchev
International Journal of Stochastic Analysis, 1999, vol. 12, 1-10
Abstract:
We consider the symmetric Poissonian two-armed bandit problem. For the case of switching arms, only one of which creates reward, we solve explicitly the Bellman equation for a β -discounted reward and prove that a myopic policy is optimal.
Date: 1999
References: Add references at CitEc
Citations:
Downloads: (external link)
http://downloads.hindawi.com/journals/IJSA/12/924375.pdf (application/pdf)
http://downloads.hindawi.com/journals/IJSA/12/924375.xml (text/xml)
Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:hin:jnijsa:924375
DOI: 10.1155/S1048953399000155
Access Statistics for this article
More articles in International Journal of Stochastic Analysis from Hindawi
Bibliographic data for series maintained by Mohamed Abdelhakeem ().