Abstract
AbstractBackgroundMuseum specimens represent an unparalleled record of historical genomic data. However, the wide-spread practice of formalin preservation has thus far impeded genomic analysis of a large proportion of specimens. Limited DNA sequencing from formalin-preserved specimens has yielded low genomic coverage with unpredictable success. We set out to refine sample processing methods and to identify specimen characteristics predictive of sequencing success. With a set of taxonomically diverse specimens collected between 1936 and 2015 and ranging in preservation quality, we compared the efficacy of several end-to-end whole genome sequencing workflows alongside a k-mer-based trimming-free read alignment approach to maximize mapping of endogenous sequence.ResultsWe recovered complete mitochondrial genomes and up to 3X nuclear genome coverage from formalin-fixed tissues. Hot alkaline lysis coupled with phenol-chloroform extraction out- performed proteinase K digestion in recovering DNA, while library preparation method had little impact on sequencing success. The strongest predictor of DNA yield was overall specimen condition, which additively interacts with preservation conditions to accelerate DNA degradation.ConclusionsWe demonstrate a significant advance in capability beyond limited recovery of a small number of loci via PCR or target-capture sequencing. To facilitate strategic selection of suitable specimens for genomic sequencing, we present a decision-making framework that utilizes independent and non-destructive assessment criteria. Sequencing of formalin-fixed specimens will contribute to a greater understanding of temporal trends in genetic adaptation, including those associated with a changing climate. Our work enhances the value of museum collections worldwide by unlocking genomes of specimens that have been disregarded as a valid molecular resource.
Publisher
Cold Spring Harbor Laboratory
Cited by
1 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献