Abstract
AbstractRecent technological developments have made genome sequencing and assembly accessible to many groups. However, the presence in sequenced organisms of certain genomic features such as high heterozygosity, polyploidy, aneuploidy, or heterokaryosis can challenge current standard assembly procedures and result in highly fragmented assemblies. Hence, we hypothesized that genome databases must contain a non-negligible fraction of low-quality assemblies that result from such type of intrinsic genomic factors. Here we present Karyon, a Python-based toolkit that uses raw sequencing data and de novo genome assembly to assess several parameters and generate informative plots to assist in the identification of non-chanonical genomic traits. Karyon includes automated de novo genome assembly and variant calling pipelines. We tested Karyon by diagnosing 35 highly fragmented publicly available assemblies from 19 different Mucorales (Fungi) species. Our results show that 6 (17%) of the assemblies presented signs of unusual genomic configurations, suggesting that these are common, at least within the Fungi.
Publisher
Cold Spring Harbor Laboratory
Reference81 articles.
1. Haplotype assembly in polyploid genomes and identical by descent shared tracts
2. Leveraging Single-Cell Genomics to Expand the Fungal Tree of Life;Nature Microbiology,2018
3. Ploidy variation in multinucleate cells changes under stress
4. A chromosome 4 trisomy contributes to increased fluconazole resistance in a clinical isolate of Candida albicans
5. Brettanomyces bruxellensis Population Survey Reveals a Diploid-Triploid Complex Structured According to Substrate of Isolation and Geographical Distribution;Scientific Reports,2018