A Within-Group Approach to Ensemble Machine Learning Methods for Causal Inference in Multilevel Studies-Reference-Cited by-同舟云学术

A Within-Group Approach to Ensemble Machine Learning Methods for Causal Inference in Multilevel Studies

Published:2023-04-25 Issue:1 Volume:49 Page:61-91
ISSN:1076-9986
Container-title:Journal of Educational and Behavioral Statistics
language:en
Short-container-title:Journal of Educational and Behavioral Statistics

Author:

Suk Youmi¹^ORCID

Affiliation:

1. Teachers College, Columbia University

Abstract

Machine learning (ML) methods for causal inference have gained popularity due to their flexibility to predict the outcome model and the propensity score. In this article, we provide a within-group approach for ML-based causal inference methods in order to robustly estimate average treatment effects in multilevel studies when there is cluster-level unmeasured confounding. We focus on one particular ML-based causal inference method based on the targeted maximum likelihood estimation (TMLE) with an ensemble learner called SuperLearner. Through our simulation studies, we observe that training TMLE within groups of similar clusters helps remove bias from cluster-level unmeasured confounders. Also, using within-group propensity scores estimated from fixed effects logistic regression increases the robustness of the proposed within-group TMLE method. Even if the propensity scores are partially misspecified, the within-group TMLE still produces robust ATE estimates due to double robustness with flexible modeling, unlike parametric-based inverse propensity weighting methods. We demonstrate our proposed methods and conduct sensitivity analyses against the number of groups and individual-level unmeasured confounding to evaluate the effect of taking an eighth-grade algebra course on math achievement in the Early Childhood Longitudinal Study.

Funder

National Science Foundation

Publisher

American Educational Research Association (AERA)

Subject

Social Sciences (miscellaneous),Education

Link

http://journals.sagepub.com/doi/pdf/10.3102/10769986231162096

Reference57 articles.

1. Anderson R., Chang B. (2011). Mathematics course-taking in rural high schools. Journal of Research in Rural Education, 26(1), 1–10. http://sites.psu.edu/jrre/wp-content/uploads/sites/6347/2014/02/26-1.pdf

2. The Role of the Propensity Score in Fixed Effect Models

3. Propensity score matching with clustered data. An application to the estimation of the impact of caesarean section on the Apgar score

4. The specification of the propensity score in multilevel observational studies

5. Recursive partitioning for heterogeneous causal effects

Cited by 2 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Designing Optimal, Data-Driven Policies from Multisite Randomized Trials;Psychometrika;2023-10-24

2. Parametric and nonparametric propensity score estimation in multilevel observational studies;Statistics in Medicine;2023-08-02