On the Conjecture of Berry Regarding a Bernoulli Two-Armed Bandit
Jichen Zhang and
Panyu Wu ()
Additional contact information
Jichen Zhang: School of Mathematics, Shandong University, Jinan 250100, China
Panyu Wu: Zhongtai Securities Institute for Financial Studies, Shandong University, Jinan 250100, China
Mathematics, 2023, vol. 11, issue 3, 1-19
Abstract:
In this paper, we study an independent Bernoulli two-armed bandit with unknown parameters ρ and λ , where ρ and λ have a pair of priori distributions such that d R ( ρ ) = C R ρ r 0 ( 1 − ρ ) r 0 ′ d μ ( ρ ) , d L ( λ ) = C L λ l 0 ( 1 − λ ) l 0 ′ d μ ( λ ) and μ is an arbitrary positive measure on [ 0 , 1 ] . Berry proposed the conjecture that, given a pair of priori distributions ( R , L ) of parameters ρ and λ , the arm with R is the current optimal choice if r 0 + r 0 ′ < l 0 + l 0 ′ and the expectation of ρ is not less than that of λ . We give an easily verifiable equivalent form of Berry’s conjecture and use it to prove that Berry’s conjecture holds when R and L are two-point distributions as well as when R and L are beta distributions and the number of trials N ≤ r 0 r 0 ′ + 1 .
Keywords: Bernoulli two-armed bandit; stochastically maximizing; prior distributions; Bayesian decision theory (search for similar items in EconPapers)
JEL-codes: C (search for similar items in EconPapers)
Date: 2023
References: View references in EconPapers View complete reference list from CitEc
Citations:
Downloads: (external link)
https://www.mdpi.com/2227-7390/11/3/733/pdf (application/pdf)
https://www.mdpi.com/2227-7390/11/3/733/ (text/html)
Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:gam:jmathe:v:11:y:2023:i:3:p:733-:d:1053697
Access Statistics for this article
Mathematics is currently edited by Ms. Emma He
More articles in Mathematics from MDPI
Bibliographic data for series maintained by MDPI Indexing Manager ().