EconPapers    
Economics at your fingertips  
 

Minimax rates and adaptivity in combining experimental and observational data

Chen Shuxiao, Li Sai, Zhang Bo and Ye Ting ()
Additional contact information
Chen Shuxiao: Department of Statistics and Data Science, The Wharton School, University of Pennsylvania, Philadelphia, Pennsylvania, United States
Li Sai: Institute of Statistics and Big Data, Renmin University of China, Haidian, Beijing, P. R. China
Zhang Bo: Vaccine and Infectious Disease Division, Fred Hutchinson Cancer Center, Seattle, Washington, United States
Ye Ting: Department of Biostatistics, University of Washington, Seattle, Washington, United States

Journal of Causal Inference, 2025, vol. 13, issue 1, 40

Abstract: Randomized controlled trials (RCTs) are the gold standard for evaluating the causal effect of a treatment; however, they often have limited sample sizes and sometimes poor generalizability. On the other hand, non-randomized, observational data derived from large administrative databases have massive sample sizes and better generalizability, but are prone to unmeasured confounding bias. It is thus of considerable interest to reconcile effect estimates obtained from RCTs and observational studies investigating the same intervention, potentially harvesting the best from both realms. In this article, we theoretically characterize the potential efficiency gain from integrating observational data into the RCT-based analysis from a minimax perspective. For estimation, we derive the minimax rate of convergence for the mean-squared error and propose adaptive estimators that attain the optimal rate up to poly-log factors. For inference, we characterize the minimax rate for the length of confidence intervals and show that adaptation (to unknown confounding bias) is in general impossible. A curious phenomenon thus emerges: for estimation, the efficiency gain from data integration can be achieved without prior knowledge of the magnitude of the confounding bias; for inference, the same task becomes information theoretically impossible in general. We corroborate our theoretical findings using simulations and a real data example from the RCT DUPLICATE initiative.

Keywords: causal inference; generalizability; integrative data analysis; observational studies; randomized controlled trials; transfer learning (search for similar items in EconPapers)
Date: 2025
References: Add references at CitEc
Citations:

Downloads: (external link)
https://doi.org/10.1515/jci-2024-0024 (text/html)

Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.

Export reference: BibTeX RIS (EndNote, ProCite, RefMan) HTML/Text

Persistent link: https://EconPapers.repec.org/RePEc:bpj:causin:v:13:y:2025:i:1:p:40:n:1001

DOI: 10.1515/jci-2024-0024

Access Statistics for this article

Journal of Causal Inference is currently edited by Elias Bareinboim, Jin Tian and Iván Díaz

More articles in Journal of Causal Inference from De Gruyter
Bibliographic data for series maintained by Peter Golla ().

 
Page updated 2025-07-22
Handle: RePEc:bpj:causin:v:13:y:2025:i:1:p:40:n:1001