Affiliation:
1. Department of Statistics University of Illinois at Urbana‐Champaign Champaign Illinois 61820 USA
2. Annenberg School for Communication University of Pennsylvania Philadelphia Pennsylvania 19104 USA
Abstract
Public health data, such as HIV new diagnoses, are often left‐censored due to confidentiality issues. Standard analysis approaches that assume censored values as missing at random often lead to biased estimates and inferior predictions. Motivated by the Philadelphia areal counts of HIV new diagnosis for which all values less than or equal to 5 are suppressed, we propose two methods to reduce the adverse influence of missingness on predictions and imputation of areal HIV new diagnoses. One is the likelihood‐based method that integrates the missing mechanism into the likelihood function, and the other is a nonparametric algorithm for matrix factorization imputation. Numerical studies and the Philadelphia data analysis demonstrate that the two proposed methods can significantly improve prediction and imputation based on left‐censored HIV data. We also compare the two methods on their robustness to model misspecification and find that both methods appear to be robust for prediction, while their performance for imputation depends on model specification.
Funder
National Institute of Allergy and Infectious Diseases
National Institute of Mental Health
National Institute on Drug Abuse
Subject
Statistics, Probability and Uncertainty,Statistics and Probability
Cited by
1 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献