EconPapers    
Economics at your fingertips  
 

Neighborhood-based cross fitting approach to treatment effects with high-dimensional data

Oluwagbenga David Agboola and Han Yu

Computational Statistics & Data Analysis, 2023, vol. 186, issue C

Abstract: High-dimensional data are increasingly popular in various physical, biological and social disciplines. A common existing approach of repeatedly splitting data was suggested to address the overfitting problem in high-dimensional statistics, however it is computationally expensive in high dimensions. A computationally efficient data splitting method is proposed and referred to as Neighborhood-Based Cross Fitting (NBCF) double machine learning in causal inference for structural causal models with high-dimensional data. The proposed method deals well with the problem of post-selection bias in causal inference in the presence of high-dimensional confounding. It provides an equivalent basis in unbiased estimation as repeated data splitting, which is suggested to expand the complexity scope of function class by empirical process methods. Numerical simulation studies were conducted to demonstrate that the proposed neighborhood-based approach is not only more computationally efficient than the existing sample splitting methods, but also better in bias reduction compared with other existing methods. Under certain conditions, simulation results further showed that the proposed estimators are asymptotically unbiased and normally distributed, which allows construction of valid confidence intervals. The practical application of NBCF is illustrated with a real dataset.

Keywords: Structural causal model; High-dimensional data; Confounder; Data splitting; Support points; Machine learning (search for similar items in EconPapers)
Date: 2023
References: View references in EconPapers View complete reference list from CitEc
Citations:

Downloads: (external link)
http://www.sciencedirect.com/science/article/pii/S0167947323000919
Full text for ScienceDirect subscribers only.

Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.

Export reference: BibTeX RIS (EndNote, ProCite, RefMan) HTML/Text

Persistent link: https://EconPapers.repec.org/RePEc:eee:csdana:v:186:y:2023:i:c:s0167947323000919

DOI: 10.1016/j.csda.2023.107780

Access Statistics for this article

Computational Statistics & Data Analysis is currently edited by S.P. Azen

More articles in Computational Statistics & Data Analysis from Elsevier
Bibliographic data for series maintained by Catherine Liu ().

 
Page updated 2025-03-19
Handle: RePEc:eee:csdana:v:186:y:2023:i:c:s0167947323000919