Variational inference for microbiome survey data with application to global ocean data

Author:

Mishra Aditya,McNichol Jesse,Fuhrman Jed,Blei David,Müller Christian L.ORCID

Abstract

AbstractLinking sequence-derived microbial taxa abundances to host (patho-)physiology or habitat characteristics in a reproducible and interpretable manner has remained a formidable challenge for the analysis of microbiome survey data. Here, we introduce a flexible probabilistic modeling framework, VI-MIDAS (VariationalInference forMIcrobiome surveyDAta analysiS), that enablesjointestimation of context-dependent drivers and broad patterns of associations of microbial taxon abundances from microbiome survey data. VI-MIDAS comprises mechanisms for direct coupling of taxon abundances with covariates and taxa-specific latent coupling which can incorporate spatio-temporal informationandtaxon-taxon interactions. We leverage mean-field variational inference for posterior VI-MIDAS model parameter estimation and illustrate model building and analysis using Tara Ocean Expedition survey data. Using VI-MIDAS’ latent embedding model and tools from network analysis, we show that marine microbial communities can be broadly categorized into five modules, including SAR11-, Nitrosopumilus-, and Alteromondales-dominated communities, each associated with specific environmental and spatiotemporal signatures. VI-MIDAS also finds evidence for largely positive taxon-taxon associations in SAR11 or Rhodospirillales clades, and negative associations with Alteromonadales and Flavobacteriales classes. Our results indicate that VI-MIDAS provides a powerful integrative statistical analysis framework for discovering broad patterns of associations between microbial taxa and context-specific covariate data from microbiome survey data.

Publisher

Cold Spring Harbor Laboratory

Reference80 articles.

1. The Integrative Human Microbiome Project

2. J. (John) Aitchison . The statistical analysis of compositional data. Blackburn Press, Caldwell, N.J., 2003.

3. Oxygen modulates bacterial community composition in the coastal upwelling waters off central chile;Deep Sea Research Part II: Topical Studies in Oceanography,2018

4. Simons collaborative marine atlas project (simons cmap): An open-source portal to share, visualize, and analyze ocean data;Limnology and Oceanography: Methods,2021

5. A glm-based latent variable ordination method for microbiome samples;Biometrics,2018

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3