Abstract
AbstractLong read sequencing data, particularly those derived from the Oxford Nanopore (ONT) sequencing platform, tend to exhibit a high error rate. Here, we present NextDenovo, a highly efficient error correction and assembly tool for noisy long reads, which achieves a high level of accuracy in genome assembly. NextDenovo can rapidly correct reads; these corrected reads contain fewer errors than other comparable tools and are characterized by fewer chimeric alignments. We applied NextDenovo to the assembly of high quality reference genomes of 35 diverse humans from across the world using ONT Nanopore long read sequencing data. Based on thesede novogenome assemblies, we were able to identify the landscape of segmental duplications and gene copy number variation in the modern human population. The use of the NextDenovo program should pave the way for population-scale long-read assembly, thereby facilitating the construction of human pan-genomes, using Nanopore long read sequencing data.
Publisher
Cold Spring Harbor Laboratory
Cited by
84 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献