Optimal Data Collection for Randomized Control Trials
Pedro Carneiro,
Sokbae (Simon) Lee and
Daniel Wilhelm
No 9908, IZA Discussion Papers from Institute of Labor Economics (IZA)
Abstract:
In a randomized control trial, the precision of an average treatment effect estimator can be improved either by collecting data on additional individuals, or by collecting additional covariates that predict the outcome variable. We propose the use of pre-experimental data such as a census, or a household survey, to inform the choice of both the sample size and the covariates to be collected. Our procedure seeks to minimize the resulting average treatment effect estimator's mean squared error, subject to the researcher's budget constraint. We rely on a modification of an orthogonal greedy algorithm that is conceptually simple and easy to implement in the presence of a large number of potential covariates, and does not require any tuning parameters. In two empirical applications, we show that our procedure can lead to substantial gains of up to 58%, measured either in terms of reductions in data collection costs or in terms of improvements in the precision of the treatment effect estimator.
Keywords: randomized control trials; big data; data collection; optimal survey design; orthogonal greedy algorithm; survey costs (search for similar items in EconPapers)
JEL-codes: C55 C81 (search for similar items in EconPapers)
Pages: 56 pages
Date: 2016-04
New Economics Papers: this item is included in nep-dev, nep-ecm, nep-exp and nep-pr~
References: View references in EconPapers View complete reference list from CitEc
Citations: View citations in EconPapers (2)
Published - published in: Econometrics Journal, 2020, 23 (1), 1 - 31
Downloads: (external link)
https://docs.iza.org/dp9908.pdf (application/pdf)
Related works:
Journal Article: Optimal data collection for randomized control trials (2020) 
Working Paper: Optimal Data Collection for Randomized Control Trials (2019) 
Working Paper: Optimal data collection for randomized control trials (2017) 
Working Paper: Optimal data collection for randomized control trials (2017) 
Working Paper: Optimal data collection for randomized control trials (2017) 
Working Paper: Optimal data collection for randomized control trials (2017) 
Working Paper: Optimal Data Collection for Randomized Control Trials (2016) 
Working Paper: Optimal data collection for randomized control trials (2016) 
Working Paper: Optimal data collection for randomized control trials (2016) 
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:iza:izadps:dp9908
Ordering information: This working paper can be ordered from
IZA, Margard Ody, P.O. Box 7240, D-53072 Bonn, Germany
Access Statistics for this paper
More papers in IZA Discussion Papers from Institute of Labor Economics (IZA) IZA, P.O. Box 7240, D-53072 Bonn, Germany. Contact information at EDIRC.
Bibliographic data for series maintained by Holger Hinte ().