A Robust Approach to Quantifying Uncertainty in Matching Problems of Causal Inference

Morucci, Marco; Noor-E-Alam, Md.; Rudin, Cynthia

A Robust Approach to Quantifying Uncertainty in Matching Problems of Causal Inference

Marco Morucci (), Md. Noor-E-Alam () and Cynthia Rudin ()
Additional contact information
Marco Morucci: Center for Data Science, New York University, New York 10012
Md. Noor-E-Alam: Department of Mechanical and Industrial Engineering, Northeastern University, Boston, Massachusetts 02115
Cynthia Rudin: Department of Computer Science, Duke University, Durham, North Carolina 27708

INFORMS Joural on Data Science, 2022, vol. 1, issue 2, 156-171

Abstract: Unquantified sources of uncertainty in observational causal analyses can break the integrity of the results. One would never want another analyst to repeat a calculation with the same data set, using a seemingly identical procedure, only to find a different conclusion. However, as we show in this work, there is a typical source of uncertainty that is essentially never considered in observational causal studies: the choice of match assignment for matched groups—that is, which unit is matched to which other unit before a hypothesis test is conducted. The choice of match assignment is anything but innocuous and can have a surprisingly large influence on the causal conclusions. Given that a vast number of causal inference studies test hypotheses on treatment effects after treatment cases are matched with similar control cases, we should find a way to quantify how much this extra source of uncertainty impacts results. What we would really like to be able to report is that no matter which match assignment is made, as long as the match is sufficiently good, then the hypothesis test results are still informative. In this paper, we provide methodology based on discrete optimization to create robust tests that explicitly account for this possibility. We formulate robust tests for binary and continuous data based on common test statistics as integer linear programs solvable with common methodologies. We study the finite-sample behavior of our test statistic in the discrete-data case. We apply our methods to simulated and real-world data sets and show that they can produce useful results in practical applied settings.

Keywords: matching; hypothesis testing; robust optimization; causal inference (search for similar items in EconPapers)
Date: 2022
References: View references in EconPapers View complete reference list from CitEc
Citations:

Downloads: (external link)
http://dx.doi.org/10.1287/ijds.2022.0020 (application/pdf)

Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.

Export reference: BibTeX RIS (EndNote, ProCite, RefMan) HTML/Text

Persistent link: https://EconPapers.repec.org/RePEc:inm:orijds:v:1:y:2022:i:2:p:156-171

Access Statistics for this article

More articles in INFORMS Joural on Data Science from INFORMS Contact information at EDIRC.
Bibliographic data for series maintained by Chris Asher ().