Optimal data collection for randomized control trials
Pedro Carneiro,
Sokbae (Simon) Lee and
Daniel Wilhelm
No CWP15/16, CeMMAP working papers from Centre for Microdata Methods and Practice, Institute for Fiscal Studies
Abstract:
In a randomized control trial, the precision of an average treatment e ffect estimator can be improved either by collecting data on additional individuals, or by collecting additional covariates that predict the outcome variable. We propose the use of pre-experimental data such as a census, or a household survey, to inform the choice of both the sample size and the covariates to be collected. Our procedure seeks to minimize the resulting average treatment e ect estimator's mean squared error, subject to the researcher's budget constraint. We rely on an orthogonal greedy algorithm that is conceptually simple, easy to implement (even when the number of potential covariates is very large), and does not require any tuning parameters. In two empirical applications, we show that our procedure can lead to substantial gains of up to 58%, either in terms of reductions in data collection costs or in terms of improvements in the precision of the treatment eff ect estimator, respectively. The original version of the working paper, posted on 01 April, 2016, is available here.
Keywords: randomized control trials; big data; data collection; optimal surveydesign; orthogonal greedy algorithm; survey costs. (search for similar items in EconPapers)
JEL-codes: C55 C81 (search for similar items in EconPapers)
Date: 2016-04-01
New Economics Papers: this item is included in nep-dev, nep-exp, nep-net and nep-ore
References: View references in EconPapers View complete reference list from CitEc
Citations: View citations in EconPapers (2)
Downloads: (external link)
http://www.ifs.org.uk/uploads/cemmap/wps/cwp151616%20%282nd%20version%29.pdf (application/pdf)
Our link check indicates that this URL is bad, the error code is: 404 Not Found (http://www.ifs.org.uk/uploads/cemmap/wps/cwp151616%20%282nd%20version%29.pdf [301 Moved Permanently]--> https://www.ifs.org.uk/uploads/cemmap/wps/cwp151616%20(2nd%20version).pdf [302 Found]--> https://ifs.org.uk/uploads/cemmap/wps/cwp151616%20(2nd%20version).pdf)
Related works:
Journal Article: Optimal data collection for randomized control trials (2020) 
Working Paper: Optimal Data Collection for Randomized Control Trials (2019) 
Working Paper: Optimal data collection for randomized control trials (2017) 
Working Paper: Optimal data collection for randomized control trials (2017) 
Working Paper: Optimal data collection for randomized control trials (2017) 
Working Paper: Optimal data collection for randomized control trials (2017) 
Working Paper: Optimal Data Collection for Randomized Control Trials (2016) 
Working Paper: Optimal data collection for randomized control trials (2016) 
Working Paper: Optimal Data Collection for Randomized Control Trials (2016) 
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:ifs:cemmap:15/16
Ordering information: This working paper can be ordered from
The Institute for Fiscal Studies 7 Ridgmount Street LONDON WC1E 7AE
Access Statistics for this paper
More papers in CeMMAP working papers from Centre for Microdata Methods and Practice, Institute for Fiscal Studies The Institute for Fiscal Studies 7 Ridgmount Street LONDON WC1E 7AE. Contact information at EDIRC.
Bibliographic data for series maintained by Emma Hyman ().