EconPapers    
Economics at your fingertips  
 

Best Arm Idenfitication in Generalized Linear Bandits

Lawrence M. Wein and Abbas Kazerouni
Additional contact information
Lawrence M. Wein: Graduate School of Business, Stanford University
Abbas Kazerouni: Department of Electrical Engineering, Stanford University

Research Papers from Stanford University, Graduate School of Business

Abstract: Motivated by drug design, we consider the best-arm identification problem in generalized linear bandits. More specifically, we assume each arm has a vector of covariates, there is an unknown vector of parameters that is common across the arms, and a generalized linear model captures the dependence of rewards on the covariate and parameter vectors. The problem is to minimize the number of arm pulls required to identify an arm that is sufficiently close to optimal with a sufficiently high probability. Building on recent progress in best-arm identification for linear bandits (Xu et al. 2018), we propose the first algorithm for best-arm identification for generalized linear bandits, provide theoretical guarantees on its accuracy and sampling efficiency, and evaluate its performance in various scenarios via simulation.

Date: 2019-05
References: Add references at CitEc
Citations:

Downloads: (external link)
https://www.gsb.stanford.edu/gsb-cmis/gsb-cmis-download-auth/478026
Our link check indicates that this URL is bad, the error code is: 404 Not Found

Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.

Export reference: BibTeX RIS (EndNote, ProCite, RefMan) HTML/Text

Persistent link: https://EconPapers.repec.org/RePEc:ecl:stabus:3784

Access Statistics for this paper

More papers in Research Papers from Stanford University, Graduate School of Business Contact information at EDIRC.
Bibliographic data for series maintained by ().

 
Page updated 2025-04-07
Handle: RePEc:ecl:stabus:3784