Author:
Harvey Guillermina B.,Boggio Gabriela S.
Abstract
The presence of a large number of zero counts is quite common in studies involving count data. This causes overdispersion. Therefore, different types of models have been proposed as alternatives and a very frequent practice is to use the negative binomial model. In 2018, Bonat (2018) considered a new type of model, based on the Poisson-Tweedie dispersion models, hich can automatically adapt to different degrees of overdispersion in count data. This article presents a simulation study in order to compare the estimates derived from the Poisson- eedie model for a wide range of overdispersed data with estimates derived from the egative binomial model. In both models, the relative percent bias of the estimated coeffcients was very small. Nevertheless, the Poisson-Tweedie model showed a better performance with smaller values for the mean squared errors, particularly in scenarios with more dispersion. Hence, it would be possible to suggest the data analyst in which situations it would be enough to work with the popular negative binomial model or when it would be best to use the Poisson-Tweedie family. Additionally, the comparison between the fit of the negative binomial mode land that of the Poisson-Tweedie family is illustrated by analysing the number fpediatric consultations of a group of children who receive health care in a public health center in Rosario, Argentina. Although the results obtained in both models were similar, the estimates in the Poisson-Tweedie model were more accurate.
Publisher
Universidad Nacional de Colombia
Reference23 articles.
1. Agresti, A. (2015), Foundations of linear and generalized linear models, 1 edn, John Wiley & Sons.
2. Berger, M. & Tutz, G. (2020), Transition Models for Count Data: a Flexible Alternative to Fixed Distribution Models. arXiv preprint. arXiv:2003.12411
3. Bonat, W. H. (2016), mcglm: Multivariate covariance generalized linear models. R package version 0.3.0. https://github.com/wbonat/mcglm
4. Bonat, W. H. (2018), 'Multiple response variables regression models in R: The mcglm package', Journal of Statistical Software 84(4). https://doi.org/10.18637/jss.v084.i04
5. Bonat, W. H. & Jørgensen, B. (2016), 'Multivariate covariance generalized linear models', Journal of the Royal Statistical Society: Series C (Applied Statistics) 65(5), 649-675. https://doi.org/10.1111/rssc.12145