EconPapers    
Economics at your fingertips  
 

Earning While Learning: How to Run Batched Bandit Experiments

Jan Kemper () and Davud Rostam-Afschar
Additional contact information
Jan Kemper: ZEW, University of Mannheim

No 18429, IZA Discussion Papers from IZA Network @ LISER

Abstract: Researchers typically collect experimental data sequentially, allowing early outcome observations and adaptive treatment assignment to reduce exposure to inferior treatments. This article reviews multi-armed-bandit adaptive experimental designs that balance exploration and exploitation. Because adaptively collected experimental data through bandit algorithms violate standard asymptotics, inference is challenging. We implement an estimator that yields valid heteroskedasticity-robust confidence intervals in batched bandit designs and compare coverage in Monte Carlo simulations. We introduce bbandits for Stata, a tool for designing experiments via simulation, running interactive bandit experiments, and implementing and analyzing adaptively collected data. bbandits includes three common assignment algorithms—ε-first, ε-greedy, and Thompson sampling—and supports estimation, inference, and visualization.

Keywords: randomized controlled trial; causal inference; multi-armed bandits; experimental design; machine learning (search for similar items in EconPapers)
JEL-codes: C1 C11 C12 C13 C15 C18 C8 C87 C88 C9 D83 (search for similar items in EconPapers)
Date: 2026-03
References: Add references at CitEc
Citations:

Downloads: (external link)
https://docs.iza.org/dp18429.pdf (application/pdf)

Related works:
Working Paper: Earning While Learning: How to Run Batched Bandit Experiments (2026) Downloads
This item may be available elsewhere in EconPapers: Search for items with the same title.

Export reference: BibTeX RIS (EndNote, ProCite, RefMan) HTML/Text

Persistent link: https://EconPapers.repec.org/RePEc:iza:izadps:dp18429

Access Statistics for this paper

More papers in IZA Discussion Papers from IZA Network @ LISER Contact information at EDIRC.
Bibliographic data for series maintained by Mark Fallak ().

 
Page updated 2026-03-10
Handle: RePEc:iza:izadps:dp18429