Author:
Chen Ying,Nie Fan,Xie Shang-Qian,Zheng Ying-Feng,Bray Thomas,Dai Qi,Wang Yao-Xin,Xing Jian-feng,Huang Zhi-Jian,Wang De-Peng,He Li-Juan,Luo Feng,Wang Jian-Xin,Liu Yi-Zhi,Xiao Chuan-Le
Abstract
AbstractAlthough long Nanopore reads are advantageous in de novo genome assembly, applying Nanopore reads in genomic studies is still hindered by their complex errors. Here, we developed NECAT, an error correction and de novo assembly tool designed to overcome complex errors in Nanopore reads. We proposed an adaptive read selection and two-step progressive method to quickly correct Nanopore reads to high accuracy. We introduced a two-stage assembler to utilize the full length of Nanopore reads. NECAT achieves superior performance in both error correction and de novo assembly of Nanopore reads. NECAT requires only 7,225 CPU hours to assemble a 35X coverage human genome and achieves a 2.28-fold improvement in NG50. Furthermore, our assembly of the human WERI cell line showed an NG50 of 29 Mbp. The high-quality assembly of Nanopore reads can significantly reduce false positives in structure variation detection.
Publisher
Cold Spring Harbor Laboratory
Cited by
36 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献