Affiliation:
1. Illinois State University, Normal, USA and PA Lachenbruch FDA Center for Biologics Research and Review, Rockville, MD, USA,
Abstract
Consider repeated measures data with many zeros. For the case with one grouping factor and one repeated measure, we examine several models, assuming that the nonzero data are roughly lognormal. One of the simplest approaches is to model the zeros as left-censored observations from the lognormal distribution. A random effectis assumed for subjects. The censored model makes a strong assumption about the relationship between the zeros and the nonzero values. To check on this, you can instead assume that some of the zeros are ‘true’ zeros and model them as Bernoulli. Then the other values are modeled with a censored lognormal. A logistic model is used for the Bernoulli p, the probability of a true nonzero. The fit of the pure left-censored lognormal can be assessed by testing the hypothesis that p is 1, as described by Moulton and Halsey.1 The model can also be simplified by omitting the censoring, leaving a logistic model for the zeros and a lognormal model for the nonzero values. This is approximately equivalent to modeling the zero and nonzero values separately, a two-part model. In contrast to the censored model, this model assumes only a slight relationship (a covariance component) between the occurrence of zeros and the size of the nonzero values. The models are compared in terms of an example with data from children’s private speech.
Subject
Health Information Management,Statistics and Probability,Epidemiology
Cited by
72 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献