Abstract
AbstractSummaryMetagenome Assembled Genomes (MAGs) are often incomplete, with sequences missing due to errors in assembly or low coverage. Incomplete MAGs present a particular challenge for identification of shared genes within a microbial population, known as core genes, as a core gene missing in only a few assemblies will result in it being mischaracterized at a lower frequency. Here, we present CELEBRIMBOR, a snakemake pangenome analysis pipeline which uses a measure of genome completeness to automatically adjust the frequency threshold at which core genes are identified, enabling accurate core gene identification in MAGs.Availability and implementationCELEBRIMBOR is published under open source Apache 2.0 licence athttps://github.com/bacpop/CELEBRIMBORand is available as a Docker container. Supplementary material is available in the online version of the article.
Publisher
Cold Spring Harbor Laboratory