Abstract
AbstractMotivationRecent advances in long-read sequencing technologies have significantly facilitated the production of high-quality genome assembly. The telomere-to-telomere (T2T) gapless assembly has become the new golden standard of genome assembly efforts. Several recent efforts have claimed to produce T2T level reference genomes. However, a universal standard is still missing to qualify a genome assembly to be at T2T standard. Traditional genome assembly assessment metrics (N50 and its derivatives) have no capacity in differentiate between nearly T2T assembly and the truly T2T assembly in continuity either globally and locally. Also these metrics are independent of raw reads, which make them inflated easily by artificial operations. Therefore a gaplessness evaluation tool at single nucleotide resolution to reflect true completeness is urgently needed in the era of complete genomes.ResultsHere, we present a tool called Genome Continuity Inspector (GCI) to assess genome assembly continuity at the single base resolution, that can evaluate how close a genome assembly is close to T2T level. GCI utilized multiple aligners to map long reads from multiple platforms back to the assembly. By incorporating curated mapping coverage of high-confidence read alignments, GCI identifies potential assembly issues. Meanwhile, it also reports GCI scores to quantify the assembly overall continuity in the whole genome or chromosome scale.Availability and implementationThe open-source GCI code is freely available on Github (https://github.com/yeeus/GCI) under the MIT license.
Publisher
Cold Spring Harbor Laboratory
Cited by
1 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献