Abstract
MotivationDespite the recent progress in genome sequencing and assembly, many of the currently available assembled genomes come in a draft form. Such draft genomes consist of a large number of genomic fragments (scaffolds), whose positions and orientations along the genome are unknown. While there exists a number of methods for reconstruction of the genome from its scaffolds, utilizing various computational and wet-lab techniques, they often can produce only partial error-prone scaffold assemblies. It therefore becomes important to compare and merge scaffold assemblies produced by different methods, thus combining their advantages and highlighting present conflicts for further investigation. These tasks may be labor intensive if performed manually.ResultsWe present CAMSA—a tool for comparative analysis and merging of two or more given scaffold assemblies. The tool (i) creates an extensive report with several comparative quality metrics; (ii) constructs the most confident merged scaffold assembly; and (iii) provides an interactive framework for a visual comparative analysis of the given assemblies. Among the CAMSA features, only scaffold merging can be evaluated in comparison to existing methods. Namely, it resembles the functionality of assembly reconciliation tools, although their primary targets are somewhat different. Our evaluations show that CAMSA produces merged assemblies of comparable or better quality than existing assembly reconciliation tools while being the fastest in terms of the total running time.AvailabilityCAMSA is distributed under the MIT license and is available at http://cblab.org/camsa/.
Publisher
Cold Spring Harbor Laboratory
Reference48 articles.
1. Sergey Aganezov and Max A. Alekseyev . Multi-Genome scaffold Co-Assembly Based on the Analysis of Gene Orders and Genomic Repeats. In A. Bourgeois et al., editors, Proceedings of the 12th International Symposium on Bioinformatics Research and Applications (ISBRA), volume 9683 of Lecture Notes in Computer Science, pages 237–249, 2016.
2. Uncovering the novel characteristics of Asian honey bee, Apis cerana, by whole genome sequencing
3. L Assour and S Emrich . Multi-genome synteny for assembly improvement. In Proceedings of 7th International Conference on Bioinformatics and Computational Biology, pages 193–199, 2015.
4. Reconstruction of ancestral genomes in presence of gene gain and loss;Journal of Computational Biology,2016
5. SPAdes: A New Genome Assembly Algorithm and Its Applications to Single-Cell Sequencing
Cited by
1 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献