TRAmHap: accurate prediction of transcriptional activity from DNA methylation haplotypes in bisulfite-sequencing data

Author:

Gao Siqi1,Zhu Hanwen2,Cai Kangwen2,Liu Leiqin1,Zhang Zhiqiang1,Ding Yi1,Xu Yaochen1,Zheng Xiaoqi34,Shi Jiantao1ORCID

Affiliation:

1. State Key Laboratory of Molecular Biology, Shanghai Institute of Biochemistry and Cell Biology, Center for Excellence in Molecular Cell Science, Chinese Academy of Sciences , Shanghai 200031 , China

2. Department of Mathematics, Shanghai Normal University , Shanghai, 200234 , China

3. Center for Single-Cell Omics , School of Public Health, , Shanghai 200025 , China

4. Shanghai Jiao Tong University School of Medicine , School of Public Health, , Shanghai 200025 , China

Abstract

Abstract Deoxyribonucleic acid (DNA) methylation (DNAm) is an important epigenetic mechanism that plays a role in chromatin structure and transcriptional regulation. Elucidating the relationship between DNAm and gene expression is of great importance for understanding its role in transcriptional regulation. The conventional approach is to construct machine-learning-based methods to predict gene expression based on mean methylation signals in promoter regions. However, this type of strategy only explains about 25% of gene expression variation, and hence is inadequate in elucidating the relationship between DNAm and transcriptional activity. In addition, using mean methylation as input features neglects the heterogeneity of cell populations that can be reflected by DNAm haplotypes. We here developed TRAmaHap, a novel deep-learning framework that predicts gene expression by utilizing the characteristics of DNAm haplotypes in proximal promoters and distal enhancers. Using benchmark data of human and mouse normal tissues, TRAmHap shows much higher accuracy than existing machine-learning based methods, by explaining 60~80% of gene expression variation across tissue types and disease conditions. Our model demonstrated that gene expression can be accurately predicted by DNAm patterns in promoters and long-range enhancers as far as 25 kb away from transcription start site, especially in the presence of intra-gene chromatin interactions.

Funder

Hundred Talents Program Award

Chinese Academy of Sciences

National Natural Science Foundation of China

Publisher

Oxford University Press (OUP)

Subject

Molecular Biology,Information Systems

Reference36 articles.

1. The DNA methylation landscape in cancer;Skvortsova;Essays Biochem,2019

2. The role of DNA methylation in epigenetics of aging;Unnikrishnan;Pharmacol Ther,2019

3. DNA methylation patterns and epigenetic memory;Bird;Genes Dev,2002

4. DNA methylation: roles in mammalian development;Smith;Nat Rev Genet,2013

5. DNA methylation in mammals;Li;Cold Spring Harb Perspect Biol,2014

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3