Author:
Cheng Zhang,Xia Weibo,McKelvey Sean,He Qiang,Chen Yuzhou,Yuan Heyang
Abstract
AbstractModeling microbial communities can provide predictive insights into microbial ecology, but current modeling approaches suffer from inherent limitations. In this study, a novel modeling approach was proposed to address those limitations based on the intrinsic connection between the growth kinetics of guilds and the dynamics of individual microbial populations. To implement the modeling approach, 466 samples from four full-scale activated sludge systems were retrieved from the literature. The raw samples were processed using a data transformation method that not only increased the dataset size by three times but also enabled quantification of population dynamics. Most of the 42 family-level core populations showed overall dynamics close to zero within the sampling period, explaining their resilience to environmental perturbation. Bayesian networks built with environmental factors, perturbation, historical abundance, population dynamics, and mechanistically derived microbial kinetic parameters classified the core populations into heterotrophic and autotrophic guilds. Topological data analysis was applied to identify keystone populations and their time-dependent interactions with other populations. The data-driven inferences were validated directly using the Microbial Database for Activated Sludge (MiDAS) and indirectly by predicting population abundance and community structure using artificial neural networks. The Bray-Curtis similarity between predicted and observed communities was significantly higher with microbial kinetic parameters than without parameters (0.70 vs. 0.66), demonstrating the accuracy of the modeling approach. Implemented based on engineered systems, this modeling approach can be generalized to natural systems to gain predictive understandings of microbial ecology.
Publisher
Cold Spring Harbor Laboratory
Reference97 articles.
1. Impact of dataset size on classification performance: an empirical evaluation in the medical domain;Applied Sciences,2021
2. Anguita, D. , Ghelardoni, L. , Ghio, A. , Oneto, L. and Ridella, S . (2012) The ‘K’ in K-fold Cross Validation. ESANN 2012 proceedings, European Symposium on Artificial Neural Networks, Computational Intelligence and Machine Learning, 25–27.
3. An efficient and scalable top-down method for predicting structures of microbial communities
4. Baas, N.A. , Carlsson, G.E. , Quick, G. , Szymik, M. and Thaule, M . (2020) Topological Data Analysis, Springer.
5. The IWA Anaerobic Digestion Model No 1 (ADM1)