Debiased Machine Learning U-statistics
Juan Carlos Escanciano and
Jo\"el Robert Terschuur
Papers from arXiv.org
Abstract:
We propose a method to debias estimators based on U-statistics with Machine Learning (ML) first-steps. Standard plug-in estimators often suffer from regularization and model-selection biases, producing invalid inferences. We show that Debiased Machine Learning (DML) estimators can be constructed within a U-statistics framework to correct these biases while preserving desirable statistical properties. The approach delivers simple, robust estimators with provable asymptotic normality and good finite-sample performance. We apply our method to three problems: inference on Inequality of Opportunity (IOp) using the Gini coefficient of ML-predicted incomes given circumstances, inference on predictive accuracy via the Area Under the Curve (AUC), and inference on linear models with ML-based sample-selection corrections. Using European survey data, we present the first debiased estimates of income IOp. In our empirical application, commonly employed ML-based plug-in estimators systematically underestimate IOp, while our debiased estimators are robust across ML methods.
Date: 2022-06, Revised 2025-10
New Economics Papers: this item is included in nep-big, nep-cmp, nep-dem and nep-ecm
References: View references in EconPapers View complete reference list from CitEc
Citations: View citations in EconPapers (4)
Downloads: (external link)
http://arxiv.org/pdf/2206.05235 Latest version (application/pdf)
Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:arx:papers:2206.05235
Access Statistics for this paper
More papers in Papers from arXiv.org
Bibliographic data for series maintained by arXiv administrators ().