Affiliation:
1. Health Economics and Evaluation Research Program UCLA Center for Health Policy Research Los Angeles California USA
2. Department of Health Policy and Management UCLA Fielding School of Public Health Los Angeles California USA
3. Department of Health Policy and Management University of Maryland School of Public Health College Park Maryland USA
Abstract
AbstractObjectiveTo develop easy to use and validated predictive models to identify beneficiaries experiencing homelessness from administrative data.Data SourcesWe pooled enrollment and claims data from enrollees of the California Whole Person Care (WPC) Medicaid demonstration program that coordinated the care of a subset of Medicaid beneficiaries identified as high utilizers in 26 California counties (25 WPC Pilots). We also used public directories of social service and health care facilities.Study DesignUsing WPC Pilot‐reported homelessness status, we trained seven supervised learning algorithms with different specifications to identify beneficiaries experiencing homelessness. The list of predictors included address‐ and claims‐based indicators, demographics, health status, health care utilization, and county‐level homelessness rate. We then assessed model performance using measures of balanced accuracy (BA), sensitivity, specificity, positive predictive value, negative predictive value, and area under the receiver operating characteristic curve (area under the curve [AUC]).Data Collection/Extraction MethodsWe included 93,656 WPC enrollees from 2017 to 2018, 37,441 of whom had a WPC Pilot‐reported homelessness indicator.Principal FindingsThe random forest algorithm with all available indicators had the best performance (87% BA and 0.95 AUC), but a simpler Generalized Linear Model (GLM) also performed well (74% BA and 0.83 AUC). Reducing predictors to the top 20 and top five most important indicators in a GLM model yields only slightly lower performance (86% BA and 0.94 AUC for the top 20 and 86% BA and 0.91 AUC for the top five).ConclusionsLarge samples can be used to accurately predict homelessness in Medicaid administrative data if a validated homelessness indicator for a small subset can be obtained. In the absence of a validated indicator, the likelihood of homelessness can be calculated using county rate of homelessness, address‐ and claim‐based indicators, and beneficiary age using a prediction model presented here. These approaches are needed given the rising prevalence of homelessness and the focus of Medicaid and other payers on addressing homelessness and its outcomes.
Cited by
1 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献