Using randomized rounding of linear programs to obtain unweighted natural strata that balance many covariates
Katherine Brumberg,
Dylan S. Small and
Paul R. Rosenbaum
Journal of the Royal Statistical Society Series A, 2022, vol. 185, issue 4, 1931-1951
Abstract:
In causal inference, natural strata are a new compromise between conventional strata and matching in a fixed ratio, say pair matching or matching two controls to each treated individual. Like matching in a fixed ratio, natural strata: (a) do not require weights, (b) balance many measured covariates beyond those that define the strata and (c) provide closer balance for a measured continuous covariate coarsely cut to form strata. Unlike matching in a fixed ratio, the ratio of controls to treated individuals need not be an integer, so if the data permit a fixed ratio comparison of 1‐to‐2.5 or even 1‐to‐0.75, then these ratios are possible using natural strata. Optimal natural strata are defined by a moderate number of fixed strata plus an integer program that minimizes the imbalance in many other measured covariates that are not used to specify the strata. Solving large integer programs is computationally difficult. A tool in the theory of approximation algorithms is ‘randomized rounding of a linear program’ to produce an integer solution: a fractional solution to a linear program defines a probability distribution for an integer‐valued random variable which is sampled. We apply this tool in a new way to produce natural strata and develop new properties of randomized rounding in this context. When proportional strata are impractical, we approximate them by minimizing the earthmover distance to proportionality. The method is applied to study birth outcomes for older and younger mothers in the United States in 2018. An R package natstrat is available at CRAN.
Date: 2022
References: View references in EconPapers View complete reference list from CitEc
Citations:
Downloads: (external link)
https://doi.org/10.1111/rssa.12848
Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:bla:jorssa:v:185:y:2022:i:4:p:1931-1951
Ordering information: This journal article can be ordered from
http://ordering.onli ... 1111/(ISSN)1467-985X
Access Statistics for this article
Journal of the Royal Statistical Society Series A is currently edited by A. Chevalier and L. Sharples
More articles in Journal of the Royal Statistical Society Series A from Royal Statistical Society Contact information at EDIRC.
Bibliographic data for series maintained by Wiley Content Delivery ().