High-quality genome assembly enables prediction of allele-specific gene expression in hybrid poplar

Author:

Shi Tian-Le1ORCID,Jia Kai-Hua12ORCID,Bao Yu-Tao1ORCID,Nie Shuai3ORCID,Tian Xue-Chan1ORCID,Yan Xue-Mei1ORCID,Chen Zhao-Yang1ORCID,Li Zhi-Chao1ORCID,Zhao Shi-Wei1ORCID,Ma Hai-Yao1ORCID,Zhao Ye1ORCID,Li Xiang4ORCID,Zhang Ren-Gang5ORCID,Guo Jing6ORCID,Zhao Wei7ORCID,El-Kassaby Yousry Aly8ORCID,Müller Niels9ORCID,Van de Peer Yves10111213ORCID,Wang Xiao-Ru7ORCID,Street Nathaniel Robert14ORCID,Porth Ilga15ORCID,An Xinmin1ORCID,Mao Jian-Feng114ORCID

Affiliation:

1. State Key Laboratory of Tree Genetics and Breeding, National Engineering Research Center of Tree Breeding and Ecological Restoration, Beijing Advanced Innovation Center for Tree Breeding by Molecular Design, National Engineering Laboratory for Tree Breeding, Key Laboratory of Genetics and Breeding in Forest Trees and Ornamental Plants, Ministry of Education, College of Biological Sciences and Technology, Beijing Forestry University , Beijing 100083 , China

2. Key Laboratory of Crop Genetic Improvement & Ecology and Physiology, Institute of Crop Germplasm Resources, Shandong Academy of Agricultural Sciences , Ji’nan 250100 , China

3. Rice Research Institute, Guangdong Academy of Agricultural Sciences & Key Laboratory of Genetics and Breeding of High Quality Rice in Southern China (Co-construction by Ministry and Province), Ministry of Agriculture and Rural Affairs & Guangdong Key Laboratory of New Technology in Rice Breeding , Guangzhou 510640 , China

4. School of Agriculture, Ningxia University , Yinchuan 750021 , China

5. Yunnan Key Laboratory for Integrative Conservation of Plant Species with Extremely Small Populations, Key Laboratory for Plant Diversity and Biogeography of East Asia, Kunming Institute of Botany, Chinese Academy of Sciences , Kunming 650201, Yunnan , China

6. College of Forestry, Shandong Agricultural University , Tai’an 271000 , China

7. Umeå Plant Science Centre, Department of Ecology and Environmental Science, Umeå University , SE-901 87 Umeå , Sweden

8. Department of Forest and Conservation Sciences, Faculty of Forestry, University of British Columbia , Vancouver, Bc, V6T 1Z4, Canada

9. Thünen-Institute of Forest Genetics , 22927 Grosshansdorf , Germany

10. Department of Plant Biotechnology and Bioinformatics, Ghent University , 9052 Ghent , Belgium

11. VIB Center for Plant Systems Biology , 9052 Ghent , Belgium

12. Centre for Microbial Ecology and Genomics, Department of Biochemistry, Genetics and Microbiology, University of Pretoria , Pretoria 0028 , South Africa

13. College of Horticulture, Academy for Advanced Interdisciplinary Studies, Nanjing Agricultural University , Nanjing 210095 , China

14. Umeå Plant Science Centre, Department of Plant Physiology, Umeå University , SE-901 87 Umeå , Sweden

15. Départment des Sciences du Bois et de la Forêt, Faculté de Foresterie, de Géographie et Géomatique, Université Laval , Québec, QC G1V 0A6 , Canada

Abstract

Abstract Poplar (Populus) is a well-established model system for tree genomics and molecular breeding, and hybrid poplar is widely used in forest plantations. However, distinguishing its diploid homologous chromosomes is difficult, complicating advanced functional studies on specific alleles. In this study, we applied a trio-binning design and PacBio high-fidelity long-read sequencing to obtain haplotype-phased telomere-to-telomere genome assemblies for the 2 parents of the well-studied F1 hybrid “84K” (Populus alba × Populus tremula var. glandulosa). Almost all chromosomes, including the telomeres and centromeres, were completely assembled for each haplotype subgenome apart from 2 small gaps on one chromosome. By incorporating information from these haplotype assemblies and extensive RNA-seq data, we analyzed gene expression patterns between the 2 subgenomes and alleles. Transcription bias at the subgenome level was not uncovered, but extensive-expression differences were detected between alleles. We developed machine-learning (ML) models to predict allele-specific expression (ASE) with high accuracy and identified underlying genome features most highly influencing ASE. One of our models with 15 predictor variables achieved 77% accuracy on the training set and 74% accuracy on the testing set. ML models identified gene body CHG methylation, sequence divergence, and transposon occupancy both upstream and downstream of alleles as important factors for ASE. Our haplotype-phased genome assemblies and ML strategy highlight an avenue for functional studies in Populus and provide additional tools for studying ASE and heterosis in hybrids.

Funder

National Key R&D Program of China

National Natural Science Foundation of China

Publisher

Oxford University Press (OUP)

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3