Strategic Experimentation with Poisson Bandits
Sven Rady and
R Keller
No 7270, CEPR Discussion Papers from C.E.P.R. Discussion Papers
Abstract:
We study a game of strategic experimentation with two-armed bandits where the risky arm distributes lump-sum payoffs according to a Poisson process. Its intensity is either high or low, and unknown to the players. We consider Markov perfect equilibria with beliefs as the state variable. As the belief process is piecewise deterministic, payoff functions solve differential-difference equations. There is no equilibrium where all players use cut-off strategies, and all equilibria exhibit an 'encouragement effect' relative to the single-agent optimum. We construct asymmetric equilibria in which players have symmetric continuation values at sufficiently optimistic beliefs yet take turns playing the risky arm before all experimentation stops. Owing to the encouragement effect, these equilibria Pareto dominate the unique symmetric one for sufficiently frequent turns. Rewarding the last experimenter with a higher continuation value increases the range of beliefs where players experiment, but may reduce average payoffs at more optimistic beliefs. Some equilibria exhibit an 'anticipation effect': as beliefs become more pessimistic, the continuation value of a single experimenter increases over some range because a lower belief means a shorter wait until another player takes over.
Keywords: Bayesian learning; Differential-difference equation; Markov perfect equilibrium; Piecewise deterministic process; Poisson process; Strategic experimentation; Two-armed bandit (search for similar items in EconPapers)
JEL-codes: C73 D83 O32 (search for similar items in EconPapers)
Date: 2009-04
References: View references in EconPapers View complete reference list from CitEc
Citations:
Downloads: (external link)
https://cepr.org/publications/DP7270 (application/pdf)
CEPR Discussion Papers are free to download for our researchers, subscribers and members. If you fall into one of these categories but have trouble downloading our papers, please contact us at subscribers@cepr.org
Related works:
Working Paper: Strategic Experimentation with Poisson Bandits (2009) 
Working Paper: Strategic Experimentation with Poisson Bandits (2007)
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:cpr:ceprdp:7270
Ordering information: This working paper can be ordered from
https://cepr.org/publications/DP7270
Access Statistics for this paper
More papers in CEPR Discussion Papers from C.E.P.R. Discussion Papers Centre for Economic Policy Research, 33 Great Sutton Street, London EC1V 0DX.
Bibliographic data for series maintained by ().