EconPapers    
Economics at your fingertips  
 

Finding the Best Dueler

Zhengu Zhang and Sheldon M. Ross ()
Additional contact information
Zhengu Zhang: Department of Industrial and Systems Engineering, University of Southern California, Los Angeles, CA 90089, USA
Sheldon M. Ross: Department of Industrial and Systems Engineering, University of Southern California, Los Angeles, CA 90089, USA

Mathematics, 2023, vol. 11, issue 7, 1-12

Abstract: Consider a set of n players. We suppose that each game involves two players, that there is some unknown player who wins each game it plays with a probability greater than 1 / 2 , and that our objective is to determine this best player. Under the requirement that the policy employed guarantees a correct choice with a probability of at least some specified value, we look for a policy that has a relatively small expected number of games played before decision. We consider this problem both under the assumption that the best player wins each game with a probability of at least some specified value p 0 > 1 / 2 , and under a Bayesian assumption that the probability that player i wins a game against player j is v i v i + v j , where v 1 , … , v n are the unknown values of n independent and identically distributed exponential random variables. In the former case, we propose a policy where chosen pairs play a match that ends when one of them has had a specified number of wins more than the other; in the latter case, we propose a Thompson sampling type rule.

Keywords: best arm identification; dueling bandit (search for similar items in EconPapers)
JEL-codes: C (search for similar items in EconPapers)
Date: 2023
References: View complete reference list from CitEc
Citations:

Downloads: (external link)
https://www.mdpi.com/2227-7390/11/7/1568/pdf (application/pdf)
https://www.mdpi.com/2227-7390/11/7/1568/ (text/html)

Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.

Export reference: BibTeX RIS (EndNote, ProCite, RefMan) HTML/Text

Persistent link: https://EconPapers.repec.org/RePEc:gam:jmathe:v:11:y:2023:i:7:p:1568-:d:1105376

Access Statistics for this article

Mathematics is currently edited by Ms. Emma He

More articles in Mathematics from MDPI
Bibliographic data for series maintained by MDPI Indexing Manager ().

 
Page updated 2025-03-19
Handle: RePEc:gam:jmathe:v:11:y:2023:i:7:p:1568-:d:1105376