Abstract
AbstractBackgroundWhite clover (Trifolium repensL.; Fabaceae) is an important forage and cover crop in agricultural pastures around the world, and is increasingly used in evolutionary ecology and genetics to understand the genetic basis of adaptation. Historically, improvements in white clover breeding practices and assessments of genetic variation in nature have been hampered by a lack of high-quality genomic resources for this species, owing in part to its high heterozygosity and allotetraploid hybrid origin.FindingsHere, we use PacBio HiFi and chromosome conformation capture (Omni-C) technologies to generate a chromosome-level, haplotype-resolved genome assembly for white clover totaling 998 Mbp (scaffold N50 = 59.3 Mbp) and 1 Gbp (scaffold N50 = 58.6 Mbp) for haplotypes 1 and 2, respectively, with each haplotype arranged into 16 chromosomes (8 per subgenome). We additionally provide a functionally annotated haploid mapping assembly (968 Mbp, scaffold N50 = 59.9 Mbp), which drastically improves on the existing reference assembly in both contiguity and assembly accuracy. We annotated 78,174 protein-coding genes, resulting in protein BUSCO completeness scores of 99.6% and 99.3% against the embryophyta_odb10 and fabales_odb10 lineage datasets, respectively.ConclusionsWe provide two white clover genome assemblies as part of this project: (1) a haplotype-resolved, chromosome-level assembly, and (2) a functionally annotated haploid mapping assembly. These assemblies place white clover among the best sequenced legumes to date, and one of the best assemblies for a plant of recent polyploid origins. This work promises to facilitate ongoing and future work in agricultural and evolutionary genetics in this agronomically and ecologically important species.
Publisher
Cold Spring Harbor Laboratory