Abstract
AbstractThe COVID-19 pandemic has emphasized the importance of detecting known and emerging pathogens from clinical and environmental samples. However, robust characterization of pathogenic sequences remains an open challenge. To this end, we developed SeqScreen, which can accurately characterize short nucleotide sequences using taxonomic and functional labels, and a customized set of curated Functions of Sequences of Concern (FunSoCs) specific to microbial pathogenesis. We show our ensemble machine learning model can label protein-coding sequences with FunSoCs with high recall and precision. SeqScreen is a step towards a novel paradigm of functionally informed pathogen characterization and is available for download at: www.gitlab.com/treangenlab/seqscreen
Publisher
Cold Spring Harbor Laboratory
Reference66 articles.
1. Synthetic DNA Synthesis and Assembly: Putting the Synthetic in Synthetic Biology
2. Biodefense in the Age of Synthetic Biology. Biodefense in the Age of Synthetic Biology. National Academies Press;
3. Synthetic DNA and biosecurity: Nuances of predicting pathogenicity and the impetus for novel computational approaches for screening oligonucleotides
4. Agents NRC (US) C on SM for the D of a GS-BCS for the O of S. Sequence-Based Classification of Select Agents. Sequence-Based Classification of Select Agents. National Academies Press;
5. Next Steps for Access to Safe, Secure DNA Synthesis
Cited by
6 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献