Handling missing data when estimating causal effects with targeted maximum likelihood estimation-Reference-Cited by-同舟云学术

Handling missing data when estimating causal effects with targeted maximum likelihood estimation

Published:2024-02-22 Issue:7 Volume:193 Page:1019-1030
ISSN:0002-9262
Container-title:American Journal of Epidemiology
language:en
Short-container-title:

Author:

Dashti S Ghazaleh^ORCID,Lee Katherine J^ORCID,Simpson Julie A^ORCID,White Ian R^ORCID,Carlin John B^ORCID,Moreno-Betancur Margarita^ORCID

Abstract

Abstract Targeted maximum likelihood estimation (TMLE) is increasingly used for doubly robust causal inference, but how missing data should be handled when using TMLE with data-adaptive approaches is unclear. Based on data (1992-1998) from the Victorian Adolescent Health Cohort Study, we conducted a simulation study to evaluate 8 missing-data methods in this context: complete-case analysis, extended TMLE incorporating an outcome-missingness model, the missing covariate missing indicator method, and 5 multiple imputation (MI) approaches using parametric or machine-learning models. We considered 6 scenarios that varied in terms of exposure/outcome generation models (presence of confounder-confounder interactions) and missingness mechanisms (whether outcome influenced missingness in other variables and presence of interaction/nonlinear terms in missingness models). Complete-case analysis and extended TMLE had small biases when outcome did not influence missingness in other variables. Parametric MI without interactions had large bias when exposure/outcome generation models included interactions. Parametric MI including interactions performed best in bias and variance reduction across all settings, except when missingness models included a nonlinear term. When choosing a method for handling missing data in the context of TMLE, researchers must consider the missingness mechanism and, for MI, compatibility with the analysis method. In many settings, a parametric MI approach that incorporates interactions and nonlinearities is expected to perform well.

Funder

Operational Infrastructure Support Program

National Health and Medical Research Council

Medical Research Council

Publisher

Oxford University Press (OUP)

Link

https://academic.oup.com/aje/advance-article-pdf/doi/10.1093/aje/kwae012/56745132/kwae012.pdf

Reference40 articles.

1. Confounding and collapsibility in causal inference;Greenland;Stat Sci.,1999

2. A definition of causal effect for epidemiological research;Hernan;J Epidemiol Community Health.,2004

3. Using big data to emulate a target trial when a randomized trial is not available;Hernan;Am J Epidemiol.,2016

4. Estimating causal effects of treatments in randomized and nonrandomized studies;Rubin;J Educ Psychol.,1974

5. Causal inference using potential outcomes: design, modeling, decisions;Rubin;J Am Stat Assoc.,2005