Optimal Data Collection for Randomized Control Trials
Pedro Carneiro,
Sokbae (Simon) Lee and
Daniel Wilhelm
No CWP21/19, CeMMAP working papers from Centre for Microdata Methods and Practice, Institute for Fiscal Studies
Abstract:
In a randomized control trial, the precision of an average treatment e?ect estimator and the power of the corresponding t-test can be improved either by collecting data on additional individuals, or by collecting additional covariates that predict the outcome variable. To design the experiment, a researcher needs to solve this tradeo? subject to her budget constraint. We show that this optimization problem is equivalent to optimally predicting outcomes by the covariates, which in turn can be solved using existing machine learning techniques using pre-experimental data such as other similar studies, a census, or a household survey. In two empirical applications, we show that our procedure can lead to reductions of up to 58% in the costs of data collection, or improvements of the same magnitude in the precision of the treatment e?ect estimator.
Date: 2019-05-02
New Economics Papers: this item is included in nep-big and nep-exp
References: View references in EconPapers View complete reference list from CitEc
Citations: View citations in EconPapers (1)
Downloads: (external link)
https://www.ifs.org.uk/uploads/CWP2021_Optimal_Dat ... d_Control_Trials.pdf (application/pdf)
Our link check indicates that this URL is bad, the error code is: 404 Not Found (https://www.ifs.org.uk/uploads/CWP2021_Optimal_Data_Collection_for%20_Randomized_Control_Trials.pdf [302 Found]--> https://ifs.org.uk/uploads/CWP2021_Optimal_Data_Collection_for%20_Randomized_Control_Trials.pdf)
Related works:
Journal Article: Optimal data collection for randomized control trials (2020) 
Working Paper: Optimal data collection for randomized control trials (2017) 
Working Paper: Optimal data collection for randomized control trials (2017) 
Working Paper: Optimal data collection for randomized control trials (2017) 
Working Paper: Optimal data collection for randomized control trials (2017) 
Working Paper: Optimal Data Collection for Randomized Control Trials (2016) 
Working Paper: Optimal data collection for randomized control trials (2016) 
Working Paper: Optimal data collection for randomized control trials (2016) 
Working Paper: Optimal Data Collection for Randomized Control Trials (2016) 
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:ifs:cemmap:21/19
Ordering information: This working paper can be ordered from
The Institute for Fiscal Studies 7 Ridgmount Street LONDON WC1E 7AE
Access Statistics for this paper
More papers in CeMMAP working papers from Centre for Microdata Methods and Practice, Institute for Fiscal Studies The Institute for Fiscal Studies 7 Ridgmount Street LONDON WC1E 7AE. Contact information at EDIRC.
Bibliographic data for series maintained by Emma Hyman ().