Abstract
The use of culture independent molecular methods, often referred to as metagenomics, have revolutionized the ability to explore and characterize microbial communities from diverse environmental sources. Most metagenomic workflows have been developed for identification of prokaryotic and eukaryotic community constituents, but tools for identification of plastid genomes are lacking. The endosymbiotic origin of plastids also poses challenges where plastid metagenomic assembled genomes (MAGs) may be misidentified as low-quality bacterial MAGs. Current tools are limited to classification of contigs as plastid and do not provide further assessment or characterization of plastid MAGs. plastiC is a workflow that allows users to identify plastid genomes in metagenome assemblies, assess completeness, and predict taxonomic association from diverse environmental sources. plastiC is a Snakemake workflow available at https://github.com/Finn-Lab/plastiC. We demonstrate the utility of this workflow with the successful recover of algal plastid MAGs from publicly available lichen metagenomes.
Funder
European Molecular Biology Laboratory
Wellcome
Subject
General Biochemistry, Genetics and Molecular Biology,Medicine (miscellaneous)
Reference17 articles.
1. UniProt: the universal protein knowledgebase in 2021.;A Bateman;Nucleic Acids Res.,2021
2. Sensitive protein alignments at tree-of-life scale using DIAMOND.;B Buchfink;Nat Methods.,2021
3. Finn-Lab/plastiC: Initial Release of plastiC - Archivable (v0.1.1).;E Cameron;Zenodo.,2023
4. CheckM2: a rapid, scalable and accurate tool for assessing microbial genome quality using 1 machine learning.;A Chklovski;bioRxiv.,2022
5. HMMER: Biosequence analysis using profile hidden Markov models;S Eddy,2022