Coassembly and binning of a twenty-year metagenomic time-series from Lake Mendota
-
Published:2024-09-04
Issue:1
Volume:11
Page:
-
ISSN:2052-4463
-
Container-title:Scientific Data
-
language:en
-
Short-container-title:Sci Data
Author:
Oliver Tiffany, Varghese Neha, Roux Simon, Schulz Frederik, Huntemann MarcelORCID, Clum Alicia, Foster Brian, Foster Bryce, Riley Robert, LaButti Kurt, Egan Robert, Hajek Patrick, Mukherjee SupratimORCID, Ovchinnikova Galina, Reddy T. B. K.ORCID, Calhoun Sara, Hayes Richard D.ORCID, Rohwer Robin R.ORCID, Zhou ZhichaoORCID, Daum ChrisORCID, Copeland AlexORCID, Chen I-Min A., Ivanova Natalia N.ORCID, Kyrpides Nikos C.ORCID, Mouncey Nigel J., del Rio Tijana Glavina, Grigoriev Igor V.ORCID, Hofmeyr Steven, Oliker Leonid, Yelick Katherine, Anantharaman KarthikORCID, McMahon Katherine D., Woyke TanjaORCID, Eloe-Fadrosh Emiley A.ORCID
Abstract
AbstractThe North Temperate Lakes Long-Term Ecological Research (NTL-LTER) program has been extensively used to improve understanding of how aquatic ecosystems respond to environmental stressors, climate fluctuations, and human activities. Here, we report on the metagenomes of samples collected between 2000 and 2019 from Lake Mendota, a freshwater eutrophic lake within the NTL-LTER site. We utilized the distributed metagenome assembler MetaHipMer to coassemble over 10 terabases (Tbp) of data from 471 individual Illumina-sequenced metagenomes. A total of 95,523,664 contigs were assembled and binned to generate 1,894 non-redundant metagenome-assembled genomes (MAGs) with ≥50% completeness and ≤10% contamination. Phylogenomic analysis revealed that the MAGs were nearly exclusively bacterial, dominated by Pseudomonadota (Proteobacteria, N = 623) and Bacteroidota (N = 321). Nine eukaryotic MAGs were identified by eukCC with six assigned to the phylum Chlorophyta. Additionally, 6,350 high-quality viral sequences were identified by geNomad with the majority classified in the phylum Uroviricota. This expansive coassembled metagenomic dataset provides an unprecedented foundation to advance understanding of microbial communities in freshwater ecosystems and explore temporal ecosystem dynamics.
Funder
NSF | BIO | Division of Environmental Biology NSF | BIO | Division of Molecular and Cellular Biosciences United States Department of Agriculture | Agricultural Research Service NSF | BIO | Division of Biological Infrastructure U.S. Department of Energy
Publisher
Springer Science and Business Media LLC
Reference19 articles.
1. Gries, C., Gahler, M. R., Hanson, P. C., Kratz, T. K. & Stanley, E. H. Information management at the North Temperate Lakes Long-term Ecological Research site — Successful support of research in a large, diverse, and long running project. Ecol. Inform. 36, 201–208 (2016). 2. Rohwer, R. R., Hale, R. J., Vander Zanden, M. J., Miller, T. R. & McMahon, K. D. Species invasions shift microbial phenology in a two-decade freshwater time series. Proc. Natl. Acad. Sci. USA 120, e2211796120 (2023). 3. DOE Joint Genome Institute. Freshwater microbial communities from Lake Mendota, Crystal Bog Lake, and Trout Bog Lake in Wisconsin, United States - time-series metagenomes. Genbank. https://identifiers.org/ncbi/bioproject:PRJNA1056043 (2023). 4. DOE Joint Genome Institute. Combined assembly of metagenomes from Lake Mendota. Genbank. https://identifiers.org/ncbi/bioproject:PRJNA1134257 (2024). 5. Hofmeyr, S. et al. Terabase-scale metagenome coassembly with MetaHipMer. Sci. Rep. 10, 10689 (2020).
|
|