scHiMe: Predicting single-cell DNA methylation levels based on single-cell Hi-C data

Author:

Zhu Hao,Liu Tong,Wang Zheng

Abstract

AbstractRecently a biochemistry experiment was developed to simultaneously capture the chromosomal conformations and DNA methylation levels on single cells. A computational tool to predict single-cell methylation levels based on single-cell Hi-C data becomes necessary due to the availability of this experiment. scHiMe was developed to predict the base-pair-specific methylation levels in the promoter regions genome-wide based on the single-cell Hi-C data and DNA nucleotide sequences using the graph transformer algorithm. Promoter-promoter spatial interaction networks were built based on single-cell Hi-C data, and single-cell DNA methylation levels on 1000 base pairs for each promoter were predicted based on the network topology and DNA sequence. Our evaluation results showed a high consistency between the predicted and the true methylation values. We tested using predicted DNA methylation levels on all promoters to classify cells into different cell types, and our results showed that the predicted DNA methylation levels resulted in almost perfect cell-type classification, which indicated that our predictions maintained the cell-to-cell variability. We also tested using the predicted DNA methylation levels of different subsets of promoters and different subsets of CpGs in promoters to classify cells and provided the promoters and CpGs that were most influential in cell-type clustering. Moreover, we observed slightly better performance for the nodes that have higher degree values in the promoter-promoter spatial interaction network but did not find a similar trend for more significant network influencers. Last but not least, we found that using the predicted methylation levels of only housekeeping genes led to less accurate cell-type clustering, which demonstrated that our methylation predictions fit the biological meanings of housekeeping genes since housekeeping genes usually have constant and similar genetic and epigenetic features among different types of cells. scHiMe is freely available at http://dna.cs.miami.edu/scHiMe/.Author SummaryDNA methylation is the process of adding methyl groups to the DNA molecule without changing the nucleotide sequence, which can significantly change the activities of the DNA. Although DNA is a one-dimensional long sequence consisting of cytosine (C), guanine (G), adenine (A), and thymine (T), it folds into a three-dimensional structure in the nucleus of the cell. Scientists believe that this 3D structure has relationships with the activities of the DNA. This research uses deep learning to predict the DNA methylation status of each cytosine-guanine pair in the promoter regions of all human genes based on the 3D structure of the DNA. This not only adds a useful method to the computational biology field but also proves that 3D genome structure does have a relationship with DNA methylation. Moreover, the 3D genome and the DNA methylation are not based on a population of cells, but on individual cells, which provides a cell-specific perspective that allows the understanding of cell-to-cell variability.

Publisher

Cold Spring Harbor Laboratory

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3