Optimal refinement of strata to balance covariates

Author:

Brumberg Katherine1ORCID,Small Dylan S2ORCID,Rosenbaum Paul R2ORCID

Affiliation:

1. Department of Statistics, University of Michigan , Ann Arbor, MI 48109 , United States

2. Department of Statistics and Data Science, University of Pennsylvania , Philadelphia, PA 19104 , United States

Abstract

Abstract What is the best way to split one stratum into two to maximally reduce the within-stratum imbalance in many covariates? We formulate this as an integer program and approximate the solution by randomized rounding of a linear program. A linear program may assign a fraction of a person to each refined stratum. Randomized rounding views fractional people as probabilities, assigning intact people to strata using biased coins. Randomized rounding is a well-studied theoretical technique for approximating the optimal solution of certain insoluble integer programs. When the number of people in a stratum is large relative to the number of covariates, we prove the following new results: (i) randomized rounding to split a stratum does very little randomizing, so it closely resembles the linear programming relaxation without splitting intact people; (ii) the linear relaxation and the randomly rounded solution place lower and upper bounds on the unattainable integer programming solution; and because of (i), these bounds are often close, thereby ratifying the usable randomly rounded solution. We illustrate using an observational study that balanced many covariates by forming matched pairs composed of 2016 patients selected from 5735 using a propensity score. Instead, we form 5 propensity score strata and refine them into 10 strata, obtaining excellent covariate balance while retaining all patients. An R package optrefine at CRAN implements the method. Supplementary materials are available online.

Funder

National Science Foundation Graduate Research Fellowship Program

Publisher

Oxford University Press (OUP)

Reference35 articles.

1. Randomization tests to assess covariate balance when designing and analyzing matched datasets;Branson;Observational Studies,2021

2. Using natural strata when examining unmeasured biases in an observational study of neurological side effects of antibiotics;Brumberg;Applied Statistics,2023

3. Using randomized rounding of linear programs to obtain unweighted natural strata that balance many covariates;Brumberg;Journal of the Royal Statistical Society A,2022

4. Balancing vs modeling approaches to weighting in practice;Chattopadhyay;Statistics in Medicine,2020

5. On the implied weights of linear regression for causal inference;Chattopadhyay;Biometrika,2023

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3