Affiliation:
1. Department of Multimedia Engineering, Dongguk University, Seoul, South Korea
Abstract
Technologies for next-generation sequencing (NGS) have stimulated an exponential rise in high-throughput sequencing projects and resulted in the development of new read-assembly algorithms. A drastic reduction in the costs of generating short reads on the genomes of new organisms is attributable to recent advances in NGS technologies such as Ion Torrent, Illumina, and PacBio. Genome research has led to the creation of high-quality reference genomes for several organisms, and de novo assembly is a key initiative that has facilitated gene discovery and other studies. More powerful analytical algorithms are needed to work on the increasing amount of sequence data. We make a thorough comparison of the de novo assembly algorithms to allow new users to clearly understand the assembly algorithms: overlap-layout-consensus and de-Bruijn-graph, string-graph based assembly, and hybrid approach. We also address the computational efficacy of each algorithm’s performance, challenges faced by the assem- bly tools used, and the impact of repeats. Our results compare the relative performance of the different assemblers and other related assembly differences with and without the reference genome. We hope that this analysis will contribute to further the application of de novo sequences and help the future growth of assembly algorithms.
Reference104 articles.
1. DNA sequence analysis with droplet-based microfluidics;Abate;Lab on a Chip,2013
2. A comparison of seed-and-extend techniques in modern DNA read alignment algorithms;Ahmed,2016
3. Basic local alignment search tool;Altschul;Journal of Molecular Biology,1990
4. Next-generation DNA sequencing techniques;Ansorge;New Biotechnology,2009
5. SPAdes: a new genome assembly algorithm and its applications to single-cell sequencing;Bankevich;Journal of Computational Biology,2012
Cited by
19 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献