Abstract
AbstractRegional prevalence estimation requires the use of suitable statistical methods on epidemiologic data with substantial local detail. Small area estimation with medical treatment records as covariates marks a promising combination for this purpose. However, medical routine data often has strong internal correlation due to diagnosis-related grouping in the records. Depending on the strength of the correlation, the space spanned by the covariates can become rank-deficient. In this case, prevalence estimates suffer from unacceptable uncertainty as the individual contributions of the covariates to the model cannot be identified properly. We propose an area-level logit mixed model for regional prevalence estimation with a new fitting algorithm to solve this problem. We extend the Laplace approximation to the log-likelihood by an $$\ell _2$$
ℓ
2
-penalty in order to stabilize the estimation process in the presence of covariate rank-deficiency. Empirical best predictors under the model and a parametric bootstrap for mean squared error estimation are presented. A Monte Carlo simulation study is conducted to evaluate the properties of our methodology in a controlled environment. We further provide an empirical application where the district-level prevalence of multiple sclerosis in Germany is estimated using health insurance records.
Funder
Spanish Grant
Deutsche Forschungsgemeinschaft
Scientic Institute of the German Public Health Insurance Company
Publisher
Springer Science and Business Media LLC
Subject
Statistics, Probability and Uncertainty,Statistics and Probability
Reference49 articles.
1. Akaike H (1974) A new look at the statistical model identification. IEEE Transactions Automatic Control 19(6):716–723
2. AOK Bundesverband (2018) Zahlen und Fakten 2018 mit zusätzlichen Grafiken zur Pflegeversicherung. https://aok-bv.de/imperia/md/aokbv/aok/zahlen/zuf_2018_ppt_final.pdf
3. Berg, EJ (2010) A small area procedure for estimating population counts doctoralthesis, Iowa State University
4. Bergstra J, Bengio Y (2012) Random search for hyper-parameter optimization. J Mach Learn 13:281–305
5. Boubeta M, Lombardía MJ, Morales D (2016) Empirical best prediction under area-level poisson mixed models. TEST 25:548–569
Cited by
3 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献