Abstract
AbstractThere is a growing interest in the study of human endogenous retroviruses (HERVs) given the substantial body of evidence that implicates them in many human diseases. Although their genomic characterization presents numerous technical challenges, next-generation sequencing (NGS) has shown potential to detect HERV insertions and their polymorphisms in humans, and a number of computational tools to detect them in short-read NGS data exist. In order to design optimal analysis pipelines, an independent evaluation of the currently available tools is required. We evaluated the performance of a set of such tools using a variety of experimental designs and types of NGS datasets. These included 50 human short read whole-genome sequencing samples, matching long and short read NGS data, and simulated short-read NGS data. Our results highlight the performance variability of the tools across the datasets and suggest that different tools might be suitable for different study designs. Using multiple tools and a consensus approach is advisable if computationally feasible and wet-lab validation via PCR is advisable where biological samples are available.
Publisher
Cold Spring Harbor Laboratory
Cited by
2 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献