Optimal data collection for randomized control trials
Pedro Carneiro,
Sokbae (Simon) Lee and
Daniel Wilhelm
No 45/17, CeMMAP working papers from Institute for Fiscal Studies
Abstract:
In a randomized control trial, the precision of an average treatment effect estimator and the power of the corresponding t-test can be improved either by collecting data on additional individuals, or by collecting additional covariates that predict the outcome variable. We propose the use of pre-experimental data such as other similar studies, a census, or a household survey, to inform the choice of both the sample size and the covariates to be collected. Our procedure seeks to minimize the resulting average treatment effect estimator's mean squared error and/or maximize the corresponding t-test's power, subject to the researcher's budget constraint. We rely on a modication of an orthogonal greedy algorithm that is conceptually simple and easy to implement in the presence of a large number of potential covariates, and does not require any tuning parameters. In two empirical applications, we show that our procedure can lead to reductions of up to 58% in the costs of data collection, or improvements of the same magnitude in the precision of the treatment effect estimator.
Date: 2017-10-23
References: View references in EconPapers View complete reference list from CitEc
Citations:
Downloads: (external link)
https://www.cemmap.ac.uk/wp-content/uploads/2020/08/CWP4517.pdf (application/pdf)
Related works:
Journal Article: Optimal data collection for randomized control trials (2020) 
Working Paper: Optimal Data Collection for Randomized Control Trials (2019) 
Working Paper: Optimal data collection for randomized control trials (2017) 
Working Paper: Optimal data collection for randomized control trials (2017) 
Working Paper: Optimal data collection for randomized control trials (2017) 
Working Paper: Optimal Data Collection for Randomized Control Trials (2016) 
Working Paper: Optimal data collection for randomized control trials (2016) 
Working Paper: Optimal data collection for randomized control trials (2016) 
Working Paper: Optimal Data Collection for Randomized Control Trials (2016) 
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:azt:cemmap:45/17
DOI: 10.1920/wp.cem.2017.4517
Access Statistics for this paper
More papers in CeMMAP working papers from Institute for Fiscal Studies Contact information at EDIRC.
Bibliographic data for series maintained by Dermot Watson ().