Abstract
ABSTRACTCurcuma longa, or turmeric, is traditionally known for its immense medicinal properties and has diverse therapeutic applications. However, the absence of a reference genome sequence is a limiting factor in understanding the genomic basis of the origin of its medicinal properties. In this study, we present the draft genome sequence of Curcuma longa, the first species sequenced from Zingiberaceae plant family, constructed using 10x Genomics linked reads. For comprehensive gene set prediction and for insights into its gene expression, the transcriptome sequencing of leaf tissue was also performed. The draft genome assembly had a size of 1.24 Gbp with ~74% repetitive sequences, and contained 56,036 coding gene sequences. The phylogenetic position of Curcuma longa was resolved through a comprehensive genome-wide phylogenetic analysis with 16 other plant species. Using 5,294 orthogroups, the comparative evolutionary analysis performed across 17 species including Curcuma longa revealed evolution in genes associated with secondary metabolism, plant phytohormones signaling, and various biotic and abiotic stress tolerance responses. These mechanisms are crucial for perennial and rhizomatous plants such as Curcuma longa for defense and environmental stress tolerance via production of secondary metabolites, which are associated with the wide range of medicinal properties in Curcuma longa.
Publisher
Cold Spring Harbor Laboratory