Author:
McGregor Kevin,Labbe Aurélie,Greenwood Celia MT,Parsons Todd,Quince Christopher
Abstract
AbstractBackgroundThe human microbiome comprises the microorganisms that inhabit the various locales of the human body and plays a vital role in human health. The composition of a microbial population is often quantified through measures of species diversity, which summarize the number of species along with their relative abundances into a single value. In a microbiome sample there will certainly be species missing from the target population which will affect the diversity estimates.MethodsWe employ a model based on the hierarchical Pitman-Yor (HPY) process to model the species abundance distributions over multiple microbiome populations. The model parameters are estimated using a Gibbs sampler. We also derive estimates of species diversity as a function of the HPY parameters.ResultsWe show that the Gibbs sampler for the HPY model performs well in the simulation study. We also show that the estimates of diversity from the HPY model improve over naïve estimates when species are missing. Similarly the HPY estimates tend to perform better than the naïve estimates when the number of individuals sampled from a population is small.
Publisher
Cold Spring Harbor Laboratory
Reference24 articles.
1. Bayesian nonparametric dependent model for partially replicated data: the influence of fuel spills on species diversity;The Annals of Applied Statistics,2016
2. Multi-armed bandit for species discovery: a bayesian nonparametric approach;Journal of the American Statistical Association,2018
3. Buntine, W. (2012), ‘Hyper-parameter estimation for the dirichlet prior’. 11, 12, 27, 28, 29
4. A bayesian view of the poisson-dirichlet process;arXiv preprint,2010
5. Cerquetti, A. (2015), Bayesian estimation of gini-simpson’s index under mainland-island community structure, in ‘Convegno della Società Italiana di Statistica’, Springer, pp. 61–70. 7
Cited by
2 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献