Abstract
AbstractConiferous trees in gymnosperm are an important source of wood production. Because of their long lifecycle, the breeding programs of coniferous tree are time- and labor-consuming. Genomics could accelerate the selection of superior trees or clones in the breeding programs; however, the genomes of coniferous trees are generally giant in size and exhibit high heterozygosity. Therefore, the generation of long contiguous genome assemblies of coniferous species has been difficult. In this study, we optimized the DNA library preparation protocols and employed high-fidelity (HiFi) long-read sequencing technology to sequence and assemble the genomes of four coniferous tree species,Larix kaempferi, Chamaecyparis obtusa, Cryptomeria japonica, andCunninghamia lanceolata. Genome assemblies of the four species totaled 13.5 Gb (L. kaempferi), 8.5 Gb (C. obtusa), 9.2 Gb (C. japonica), and 11.7 Gb (C. lanceolata), which covered 99.6% of the estimated genome sizes on average. The contig N50 value, which indicates assembly contiguity, ranged from 1.2 Mb inC. obtusato 16.0 Mb inL. kaempferi, and the assembled sequences contained, on average, 89.2% of the single-copy orthologs conserved in embryophytes. Assembled sequences representing alternative haplotypes covered 70.3–95.1% of the genomes, suggesting that the four coniferous tree genomes exhibit high heterozygosity levels. The genome sequence information obtained in this study represents a milestone in tree genetics and genomics, and will facilitate gene discovery, allele mining, phylogenetics, and evolutionary studies in coniferous trees, and accelerate forest tree breeding programs.
Publisher
Cold Spring Harbor Laboratory
Reference22 articles.
1. Genome Editing in Trees: From Multiple Repair Pathways to Long-Term Stability;Front Plant Sci,2018
2. Burdon RD , Wilcox PL (2011) Integration of Molecular Markers in Breeding. In: Plomion C , Bousquet J , Kole C (eds) Genetics, Genomics and Breeding of Conifers. CRC Press, p 47
3. Phylogenetics of Seed Plants: An Analysis of Nucleotide Sequences from the Plastid Gene rbcL
4. FAO (2020) Global forest resources assessment 2020. FAO