Abstract
AbstractAn important assessment prior to genome assembly and related analyses is genome profiling, where the k-mer frequencies within raw sequencing reads are analyzed to estimate major genome characteristics such as size, heterozygosity, and repetitiveness. Here we introduce GenomeScope 2.0 (https://github.com/tbenavi1/genomescope2.0), which applies combinatorial theory to establish a detailed mathematical model of how k-mer frequencies are distributed in heterozygous and polyploid genomes. We describe and evaluate a practical implementation of the polyploid-aware mixture model that quickly and accurately infers genome properties across thousands of simulated and several real datasets spanning a broad range of complexity. We also present a method called Smudgeplot (https://github.com/KamilSJaron/smudgeplot) to visualize and estimate the ploidy and genome structure of a genome by analyzing heterozygous k-mer pairs. We successfully apply the approach to systems of known variable ploidy levels in the Meloidogyne genus and the extreme case of octoploid Fragaria × ananassa.
Publisher
Springer Science and Business Media LLC
Subject
General Physics and Astronomy,General Biochemistry, Genetics and Molecular Biology,General Chemistry
Reference36 articles.
1. Meyers, L. A. & Levin, D. A. On the abundance of polyploids in flowering plants. Evolution 60, 1198–1206 (2006).
2. Renny-Byfield, S. & Wendel, J. F. Doubling down on genomes: polyploidy and crop plants. Am. J. Bot. 101, 1711–1725 (2014).
3. Todd, R. T., Forche, A. & Selmecki, A. Ploidy variation in fungi: polyploidy, aneuploidy, and genome evolution. Microbiol. Spectr. 5, 1–20 (2017).
4. Novikova, P. Y. et al. Whole genome duplication potentiates inter-specific hybridisation and niche shifts in australian burrowing frogs neobatrachus. https://www.biorxiv.org/content/10.1101/593699v1 (2019).
5. Le Comber, S. C. & Smith, C. Polyploidy in fishes: patterns and processes. Biol. J. Linn. Soc. 82, 431–442 (2004).
Cited by
890 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献