Linear Regressions with Combined Data
Xavier D'Haultfoeuille,
Christophe Gaillac and
Arnaud Maurel
No 25130, RFBerlin Discussion Paper Series from ROCKWOOL Foundation Berlin (RFBerlin)
Abstract:
We study linear regressions in a context where the outcome of interest and some of the covariates are observed in two different datasets that cannot be matched. Traditional approaches obtain point identification by relying, often implicitly, on exclusion restrictions. We show that without such restrictions, coefficients of interest can still be partially identified, with the sharp bounds taking a simple form. We obtain tighter bounds when variables observed in both datasets, but not included in the regression of interest, are available, even if these variables are not subject to specific restrictions. We develop computationally simple and asymptotically normal estimators of the bounds. Finally, we apply our methodology to estimate racial disparities in patent approval rates and to evaluate the effect of patience and risk-taking on educational performance.
Keywords: Data combination; best linear prediction; partial identification. (search for similar items in EconPapers)
JEL-codes: C14 C21 (search for similar items in EconPapers)
Date: 2025-12
References: Add references at CitEc
Citations:
Downloads: (external link)
https://www.rfberlin.com/wp-content/uploads/2025/11/25130.pdf (application/pdf)
Related works:
Working Paper: Linear Regressions with Combined Data (2025) 
Working Paper: Linear Regressions with Combined Data (2025) 
Working Paper: Linear Regressions with Combined Data (2025) 
Working Paper: Linear Regressions with Combined Data (2025) 
Working Paper: Linear Regressions with Combined Data (2024) 
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:crm:wpaper:25130
Access Statistics for this paper
More papers in RFBerlin Discussion Paper Series from ROCKWOOL Foundation Berlin (RFBerlin) Contact information at EDIRC.
Bibliographic data for series maintained by Moritz Lubczyk () and Matthew Nibloe ().