Affiliation:
1. Hunan Research Center of Engineering Technology for Utilization of Environmental and Resources Plant, Central South University of Forestry and Technology, Changsha, Hunan, China
2. College of Forestry, Northwest A & F University, Yangling, China
3. Hunan Urban and Rural Ecological Planning and Restoration Engineering Research Center, Hunan City University, Yiyang, Hunan, China
Abstract
Camellia oleiferais one of the four largest woody edible oil plants in the world with high ecological and medicinal values. Due to frequent interspecific hybridization, it was difficult to study its genetics and evolutionary history. This study usedC. oleiferathat was collected on Hainan Island to conduct our research. The unique island environment makes the quality of tea oil higher than that of other species grown in the mainland. Moreover, a long-term geographic isolation might affect gene structure. In order to better understand the molecular biology of this species, protect excellent germplasm resources, and promote the population genetics and phylogenetic studies ofCamelliaplants, high-throughput sequencing technology was used to obtain the chloroplast genome sequence of HainanC. oleifera. The results showed that the whole chloroplast genome ofC. oleiferain Hainan was 156,995 bp in length, with a typical quadripartite structure of a large single copy (LSC) region of 86,648 bp, a small single copy (SSC) region of 18,297 bp, and a pair of inverted repeats (IRs) of 26,025 bp. The whole genome encoded a total of 141 genes (115 different genes), including 88 protein-coding genes, 45 tRNA genes, and eight rRNA genes. Among these genes, nine genes contained one intron, two genes contained two introns, and four overlapping genes were also detected. The total GC content of HainanC. oleifera’s chloroplast genome was 37.29%. The chloroplast genome structure characteristics of HainanC. oleiferawere compared with mainlandC. oleiferaand those of the other eight closely related Theaceae species; it was found that the contractions and expansions of the IR/LSC and IR/SSC regions affected the length of chloroplast genome. The chloroplast genome sequences of these Theaceae species were highly similar. A comparative analysis indicated that the Theaceae species were conserved in structure and evolution. A total of 51 simple sequence repeat (SSR) loci were detected in the chloroplast genome of HainanC. oleifera, and allCamelliaplants did not have pentanucleotide repeats, which could be used as a good marker in phylogenetic studies. We also detected seven long repeats, the base composition of all repeats was biased toward A/T, which was consistent with the codon bias. It was found that HainanC. oleiferahad a similar evolutionary relationship withC. crapnelliana, through the use of codons and phylogenetic analysis. This study can provide an effective genomic resource for the evolutionary history of Theaceae family.
Funder
Science and technology major project of Hunan province
Key Technology R&D Program of Hunan Province
National Key Technology Research and Development Program of China
Subject
General Agricultural and Biological Sciences,General Biochemistry, Genetics and Molecular Biology,General Medicine,General Neuroscience
Reference69 articles.
1. Scaffolding pre-assembled contigs using SSPACE;Boetzer;Bioinformatics,2011
2. Are codon usage patterns in unicellular organisms determined by selection-mutation balance?;Bulmer;Journal of Evolutionary Biology,1988
3. The chloroplast genome of Passiflora edulis (Passifloraceae) assembled from long sequence reads: structural organization and phylogenomic studies in Malpighiales;Cauzsantos;Frontiers in Plant Science,2017
4. A DNA barcode for land plants;CBOL Plant Working Group;Proceedings of the National Academy of Sciences of the United States of America,2009
5. Mining and characterizing microsatellites from citrus ESTs;Chen;Theoretical and Applied Genetics,2006