Real-time search of all bacterial and viral genomic data-Reference-Cited by-同舟云学术

Real-time search of all bacterial and viral genomic data

Published:2017-12-15 Issue: Volume: Page:
ISSN:
Container-title:
language:
Short-container-title:

Author:

Bradley Phelim^ORCID,Den Bakker Henk C,Rocha Eduardo P. C.^ORCID,McVean Gil^ORCID,Iqbal Zamin^ORCID

Abstract

AbstractGenome sequencing of pathogens is now ubiquitous in microbiology, and the sequence archives are effectively no longer searchable for arbitrary sequences. Furthermore, the exponential increase of these archives is likely to be further spurred by automated diagnostics. To unlock their use for scientific research and real-time surveillance we have combined knowledge about bacterial genetic variation with ideas used in web-search, to build a DNA search engine for microbial data that can grow incrementally. We indexed the complete global corpus of bacterial and viral whole genome sequence data (447,833 genomes), using four orders of magnitude less storage than previous methods. The method allows future scaling to millions of genomes. This renders the global archive accessible to sequence search, which we demonstrate with three applications: ultra-fast search for resistance genes MCR1-3, analysis of host-range for 2827 plasmids, and quantification of the rise of antibiotic resistance prevalence in the sequence archives.

Publisher

Cold Spring Harbor Laboratory

Reference46 articles.

1. Rapid antibiotic-resistance predictions from genome sequence data for Staphylococcus aureus and Mycobacterium tuberculosis

2. Rapid Whole-Genome Sequencing of Mycobacterium tuberculosis Isolates Directly from Clinical Samples

3. Real-time, portable genome sequencing for Ebola surveillance

4. Identification of bacterial pathogens and antimicrobial resistance directly from clinical urines by nanopore-based metagenomic sequencing

5. Same-Day Diagnostic and Surveillance Data for Tuberculosis via Whole-Genome Sequencing of Direct Respiratory Samples

Cited by 18 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Performances of bioinformatics tools for the analysis of sequencing data of Mycobacterium tuberculosis complex strains;2022-07-06

2. Building large updatable colored de Bruijn graphs via merging;Bioinformatics;2019-07

3. Salmonella Genomic Island 1B Variant Found in a Sequence Type 117 Avian Pathogenic Escherichia coli Isolate;mSphere;2019-06-26

4. Genomic Investigation of the Emergence of Invasive Multidrug-Resistant Salmonella enterica Serovar Dublin in Humans and Animals in Canada;Antimicrobial Agents and Chemotherapy;2019-06

5. Assessing evolutionary risks of resistance for new antimicrobial therapies;Nature Ecology & Evolution;2019-03-18