Author:
Chandra Omkar,Pramanik Durjay,Gautam Srishti,Sharma Madhu,Dubey Niharika,Mahato Biswarup,Kumar Vibhor
Abstract
AbstractUnderstanding genome-wide epigenetic regulation of diseases is important in establishing pathogenic factors and could aid in disease diagnosis, prognosis, and therapeutics. In this study, we have utilized transcription factors (TFs) and co-factor profiles (n=823) as features to predict their link with various diseases. Further, along with TFs and co-factor profiles, histone modifications ChIP-seq (n = 621), cap analysis gene expression (CAGE) tags (n = 255), and DNase hypersensitivity profiles (n = 255) as features allowed for the modeling of association of coding and non-coding genes to diseases. Such predicted associations could be independently validated using genome-wide association data and survival analysis. However, the unique aspect of our approach is that it highlights the link between TF binding patterns and diseases in the context of cell types. Besides highlighting relevant TF-binding in known cell-types associated with diseases, it also provided their surprising link with TFs expressed in immune cells and other seemingly non-related cells. Further investigation revealed such links to be genuine and potentially useful for prognosis, further revealing the need to deconvolve a set of known genes associated with diseases.
Publisher
Cold Spring Harbor Laboratory