Using Randomized Rounding of Linear Programs to Obtain Unweighted Natural Strata that Balance Many Covariates

Author:

Brumberg Katherine1,Small Dylan S.1,Rosenbaum Paul R.1

Affiliation:

1. Wharton School, University of Pennsylvania , Philadelphia , Pennsylvania , USA

Abstract

Abstract In causal inference, natural strata are a new compromise between conventional strata and matching in a fixed ratio, say pair matching or matching two controls to each treated individual. Like matching in a fixed ratio, natural strata: (a) do not require weights, (b) balance many measured covariates beyond those that define the strata and (c) provide closer balance for a measured continuous covariate coarsely cut to form strata. Unlike matching in a fixed ratio, the ratio of controls to treated individuals need not be an integer, so if the data permit a fixed ratio comparison of 1-to-2.5 or even 1-to-0.75, then these ratios are possible using natural strata. Optimal natural strata are defined by a moderate number of fixed strata plus an integer program that minimizes the imbalance in many other measured covariates that are not used to specify the strata. Solving large integer programs is computationally difficult. A tool in the theory of approximation algorithms is ‘randomized rounding of a linear program’ to produce an integer solution: a fractional solution to a linear program defines a probability distribution for an integer-valued random variable which is sampled. We apply this tool in a new way to produce natural strata and develop new properties of randomized rounding in this context. When proportional strata are impractical, we approximate them by minimizing the earthmover distance to proportionality. The method is applied to study birth outcomes for older and younger mothers in the United States in 2018. An R package natstrat is available at CRAN.

Publisher

Oxford University Press (OUP)

Subject

Statistics, Probability and Uncertainty,Economics and Econometrics,Social Sciences (miscellaneous),Statistics and Probability

Reference26 articles.

1. Building representative matched samples with multi-valued treatments in large observational studies;Bennett;Journal of Computational and Graphical Statistics,2020

2. The effectiveness of adjustment by subclassification in removing bias in observational studies;Cochran;Biometrics,1968

3. Optimal full matching and related designs via network flows;Hansen;Journal of Computational and Graphical Statistics,2006

4. Combining propensity score matching and group-based trajectory analysis in an observational study;Haviland;Psychological Methods,2007

5. Standardization: a technique to control for extraneous variables;Kalton;Journal of the Royal Statistical Society: Series C,1968

Cited by 3 articles. 订阅此论文施引文献 订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3