Affiliation:
1. Department of Statistics, University of Padova , Padova 35121, Italy
2. Department of Biostatistics, Harvard T.H. Chan School of Public Health , Boston, 02115 MA , United States
Abstract
ABSTRACT
Several epidemiological studies have provided evidence that long-term exposure to fine particulate matter (pm2.5) increases mortality rate. Furthermore, some population characteristics (e.g., age, race, and socioeconomic status) might play a crucial role in understanding vulnerability to air pollution. To inform policy, it is necessary to identify groups of the population that are more or less vulnerable to air pollution. In causal inference literature, the group average treatment effect (GATE) is a distinctive facet of the conditional average treatment effect. This widely employed metric serves to characterize the heterogeneity of a treatment effect based on some population characteristics. In this paper, we introduce a novel Confounder-Dependent Bayesian Mixture Model (CDBMM) to characterize causal effect heterogeneity. More specifically, our method leverages the flexibility of the dependent Dirichlet process to model the distribution of the potential outcomes conditionally to the covariates and the treatment levels, thus enabling us to: (i) identify heterogeneous and mutually exclusive population groups defined by similar GATEs in a data-driven way, and (ii) estimate and characterize the causal effects within each of the identified groups. Through simulations, we demonstrate the effectiveness of our method in uncovering key insights about treatment effects heterogeneity. We apply our method to claims data from Medicare enrollees in Texas. We found six mutually exclusive groups where the causal effects of pm2.5 on mortality rate are heterogeneous.
Publisher
Oxford University Press (OUP)
Reference34 articles.
1. Causal rule ensemble: Interpretable discovery and inference of heterogeneous treatment effects;Bargagli-Stoffi,2020
2. Heterogeneous causal effects with imperfect compliance: a Bayesian machine learning approach;Bargagli-Stoffi;The Annals of Applied Statistics,2022
3. Bayesian cluster analysis;Binder;Biometrika,1978
4. Cart: Classification and regression trees;Breiman;Chapman and Hall/CRC, New York, New York,1984
5. In pursuit of evidence in air pollution epidemiology: the role of causally driven data science;Carone;Epidemiology,2020