Exact solution of the Bellman equation for a β -discounted reward in a two-armed bandit with switching arms

Donchev, Doncho S.

Exact solution of the Bellman equation for a β -discounted reward in a two-armed bandit with switching arms

Doncho S. Donchev

International Journal of Stochastic Analysis, 1999, vol. 12, 1-10

Abstract:

We consider the symmetric Poissonian two-armed bandit problem. For the case of switching arms, only one of which creates reward, we solve explicitly the Bellman equation for a β -discounted reward and prove that a myopic policy is optimal.

Date: 1999
References: Add references at CitEc
Citations:

Downloads: (external link)
http://downloads.hindawi.com/journals/IJSA/12/924375.pdf (application/pdf)
http://downloads.hindawi.com/journals/IJSA/12/924375.xml (text/xml)

Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.

Export reference: BibTeX RIS (EndNote, ProCite, RefMan) HTML/Text

Persistent link: https://EconPapers.repec.org/RePEc:hin:jnijsa:924375

DOI: 10.1155/S1048953399000155

Access Statistics for this article

More articles in International Journal of Stochastic Analysis from Hindawi
Bibliographic data for series maintained by Mohamed Abdelhakeem ().