Double/debiased machine learning for treatment and structural parameters

Chernozhukov, Victor; Chetverikov, Denis; Demirer, Mert; Duflo, Esther; Hansen, Christian; Newey, Whitney; Robins, James

Double/debiased machine learning for treatment and structural parameters

Victor Chernozhukov, Denis Chetverikov (), Mert Demirer, Esther Duflo, Christian Hansen, Whitney Newey and James Robins
Additional contact information
Denis Chetverikov: Institute for Fiscal Studies and UCLA
Mert Demirer: Institute for Fiscal Studies
Esther Duflo: Institute for Fiscal Studies
James Robins: Institute for Fiscal Studies

No CWP28/17, CeMMAP working papers from Centre for Microdata Methods and Practice, Institute for Fiscal Studies

Abstract: We revisit the classic semiparametric problem of inference on a low dimensional parameter ?0 in the presence of high-dimensional nuisance parameters ?0. We depart from the classical setting by allowing for ?0 to be so high-dimensional that the traditional assumptions, such as Donsker properties, that limit complexity of the parameter space for this object break down. To estimate ?0, we consider the use of statistical or machine learning (ML) methods which are particularly well-suited to estimation in modern, very high-dimensional cases. ML methods perform well by employing regularization to reduce variance and trading off regularization bias with overfitting in practice. However, both regularization bias and overfitting in estimating ?0 cause a heavy bias in estimators of ?0 that are obtained by naively plugging ML estimators of ?0 into estimating equations for ?0. This bias results in the naive estimator failing to be N -1/2 consistent, where N is the sample size. We show that the impact of regularization bias and overfitting on estimation of the parameter of interest ?0 can be removed by using two simple, yet critical, ingredients: (1) using Neyman-orthogonal moments/scores that have reduced sensitivity with respect to nuisance parameters to estimate ?0, and (2) making use of cross-fitting which provides an efficient form of data-splitting. We call the resulting set of methods double or debiased ML (DML). We verify that DML delivers point estimators that concentrate in a N -1/2-neighborhood of the true parameter values and are approximately unbiased and normally distributed, which allows construction of valid confidence statements. The generic statistical theory of DML is elementary and simultaneously relies on only weak theoretical requirements which will admit the use of a broad array of modern ML methods for estimating the nuisance parameters such as random forests, lasso, ridge, deep neural nets, boosted trees, and various hybrids and ensembles of these methods. We illustrate the general theory by applying it to provide theoretical properties of DML applied to learn the main regression parameter in a partially linear regression model, DML applied to learn the coefficient on an endogenous variable in a partially linear instrumental variables model, DML applied to learn the average treatment effect and the average treatment effect on the treated under unconfoundedness, and DML applied to learn the local average treatment effect in an instrumental variables setting. In addition to these theoretical applications, we also illustrate the use of DML in three empirical examples.

Date: 2017-06-02
New Economics Papers: this item is included in nep-big and nep-cmp
References: View references in EconPapers View complete reference list from CitEc
Citations: View citations in EconPapers (132)

Downloads: (external link)
https://www.ifs.org.uk/uploads/cemmap/wps/CWP281717.pdf (application/pdf)
Our link check indicates that this URL is bad, the error code is: 404 Not Found (https://www.ifs.org.uk/uploads/cemmap/wps/CWP281717.pdf [302 Found]--> https://ifs.org.uk/uploads/cemmap/wps/CWP281717.pdf)

Related works:
Journal Article: Double/debiased machine learning for treatment and structural parameters (2018)
Working Paper: Double/debiased machine learning for treatment and structural parameters (2017)
Working Paper: Double/Debiased Machine Learning for Treatment and Structural Parameters (2017)
This item may be available elsewhere in EconPapers: Search for items with the same title.

Export reference: BibTeX RIS (EndNote, ProCite, RefMan) HTML/Text

Persistent link: https://EconPapers.repec.org/RePEc:ifs:cemmap:28/17

Ordering information: This working paper can be ordered from
The Institute for Fiscal Studies 7 Ridgmount Street LONDON WC1E 7AE

Access Statistics for this paper

More papers in CeMMAP working papers from Centre for Microdata Methods and Practice, Institute for Fiscal Studies The Institute for Fiscal Studies 7 Ridgmount Street LONDON WC1E 7AE. Contact information at EDIRC.
Bibliographic data for series maintained by Emma Hyman ().