Abstract
AbstractLong-read sequencing enables variant detection in genomic regions that are considered difficult-to-map by short-read sequencing. To fully exploit the benefits of longer reads, here we present a deep-learning method NanoCaller, which detects SNPs using long-range haplotype information, then phases long reads with called SNPs and calls indels with local realignment. Evaluation on 8 human genomes demonstrated that NanoCaller generally achieves better performance than competing approaches. We experimentally validated 41 novel variants in a widely-used benchmarking genome, which cannot be reliably detected previously. In summary, NanoCaller facilitates the discovery of novel variants in complex genomic regions from long- read sequencing.
Publisher
Cold Spring Harbor Laboratory
Cited by
3 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献