Debiased Machine Learning U-statistics
Juan Carlos Escanciano and
Jo\"el Robert Terschuur
Papers from arXiv.org
Abstract:
We propose a method to debias estimators based on U-statistics with machine-learning (ML) first steps. Standard plug-in estimators often suffer from regularization and model-selection biases, leading to invalid inference. We characterize orthogonal adjustment terms for two-step U-statistics, construct Debiased Machine Learning (DML) U-estimators, develop a cross-fitting algorithm, and establish a general asymptotic theory for inference with ML first steps. We illustrate the methodology with applications to Inequality of Opportunity (IOp), the Area Under the ROC Curve, and conditional moment restrictions. For the latter setting, we introduce Kernel-DML, a class of orthogonal estimators based on identification-preserving kernel distance criteria, and apply it to semiparametric production function estimation. Using European survey data, we provide debiased ML-based estimates of income IOp and show that the widely used Conditional Inference Forest plug-in approach can substantially underestimate IOp. Moreover, plug-in results vary substantially across ML methods, whereas the debiased estimates are considerably more stable.
Date: 2022-06, Revised 2026-06
New Economics Papers: this item is included in nep-big, nep-cmp, nep-dem and nep-ecm
References: View references in EconPapers View complete reference list from CitEc
Citations: View citations in EconPapers (4)
Downloads: (external link)
http://arxiv.org/pdf/2206.05235 Latest version (application/pdf)
Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:arx:papers:2206.05235
Access Statistics for this paper
More papers in Papers from arXiv.org
Bibliographic data for series maintained by arXiv administrators ().