Abstract
AbstractMotivationStrain-level analysis of metagenomic data has garnered significant interest in recent years. Microbial single nucleotide polymorphisms (SNPs) are genomic variants that can reflect strain-level differences within a microbial species. The diversity and emergence of SNPs in microbial genomes may reveal evolutionary history and environmental adaptation in microbial populations. However, efficient discovery of shared polymorphic variants in a large collection metagenomic samples remains a computational challenge.ResultsMetaQuad employs a density-based clustering technique to efficiently differentiate shared variants from non-polymorphic sites using shotgun metagenomic data. Empirical comparisons with other state-of-the-art methods show that MetaQuad significantly reduces the number of false-positive SNPs without greatly affecting the true-positive rate. We used MetaQuad to identify antibiotic-associated variants in patients who underwentHelicobacter pylorieradication therapy. MetaQuad detected 7,591 variants across 529 antibiotic resistance genes. The nucleotide diversity of some genes is increased six weeks after antibiotic treatment, potentially indicating the role of these genes in specific antibiotic treatments.AvailabilityMetaQuad is an open-source Python package available viahttps://github.com/holab-hku/MetaQuad.Contactjwkho@hku.hkSupplementary informationSupplementary data are available atXXXXonline.
Publisher
Cold Spring Harbor Laboratory