Abstract
AbstractThe promise of biotechnology is tempered by its potential for accidental or deliberate misuse. Reliably identifying telltale signatures characteristic to different genetic designers, termed ‘genetic engineering attribution’, would deter misuse, yet is still considered unsolved. Here, we show that recurrent neural networks trained on DNA motifs and basic phenotype data can reach 70% attribution accuracy in distinguishing between over 1,300 labs. To make these models usable in practice, we introduce a framework for weighing predictions against other investigative evidence using calibration, and bring our model to within 1.6% of perfect calibration. Additionally, we demonstrate that simple models can accurately predict both the nation-state-of-origin and ancestor labs, forming the foundation of an integrated attribution toolkit which should promote responsible innovation and international security alike.
Funder
U.S. Department of Health & Human Services | National Institutes of Health
Publisher
Springer Science and Business Media LLC
Subject
General Physics and Astronomy,General Biochemistry, Genetics and Molecular Biology,General Chemistry
Reference59 articles.
1. Engelberg, S. New evidence adds doubt to FBI’s case against anthrax suspect—ProPublica. ProPublica https://www.propublica.org/article/new-evidence-disputes-case-against-bruce-e-ivins (2011).
2. Skane, W. Science alone does not establish source of anthrax used in 2001 mailings. http://www8.nationalacademies.org/onpinews/newsitem.aspx?RecordID=13098 (2011).
3. Cummings, C. A. & Relman, D. A. Microbial forensics-‘cross-examining pathogens’. Science 296, 1976–1979 (2002).
4. Budowle, B. et al. Building microbial forensics as a response to bioterrorism. Science 301, 1852–1853 (2003).
5. Shane, S. & Wade, N. Pressure grows for F.B.I.’s anthrax evidence. NY Times (2008).
Cited by
16 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献