Abstract
AbstractEpigenetic mechanisms coordinate packaging, accessibility and read-out of the DNA sequence within the chromatin context. They significantly contribute to the regulation of gene expression. Thus, they play fundamental roles during differentiation on the one hand and maintenance and propagation of cell identity on the other. Epigenetic malfunctioning is associated with a large range of diseases, from neurodevelopmental disorders to cancer progression. In humans, hundreds of known epigenetic factors and complexes are involved in establishing covalent modifications on the DNA sequence itself and on associated histone proteins. Within the cellular context, the resulting combinatorial epigenomic patterns are neither established nor interpreted independently of each other and therefore exhibit high correlations in a region-specific manner. Post-translational modifications of histone proteins can be analysed using Chromatin Immunoprecipitation followed by sequencing (ChIP-Seq). Often, several assays for a number of different histone modifications are performed as part of the same experimental design. These measurements are, however, confounded by shared biases including chromatin accessibility and mappability. Existing computational methods analyse each histone modification separately. We introduce DecoDen, a new approach that leverages replicates and multi-histone ChIP-Seq experiments for a fixed cell type to learn and remove shared biases. DecoDen (Deconvolve and Denoise) consists of two major steps: We use non-negative matrix factorisation (NMF) to learn a joint cell-type specific background signal. Half-sibling regression (HSR) is then used to correct for these biases in the histone modification signals. We demonstrate that DecoDen is a robust and interpretable method that enables the unbiased discovery of subtle peaks, which are particularly important in an individual-specific context.
Publisher
Cold Spring Harbor Laboratory
Reference22 articles.
1. Advanced: Call peaks using macs2 subcommands. https://github.com/macs3-project/MACS/wiki/Advanced:-Call-peaks-using-MACS2-subcommands, accessed: 2022-10-08
2. The encode blacklist: identification of problematic regions of the genome;Scientific reports,2019
3. Detailed specificity analysis of antibodies binding to modified histone tails with peptide arrays
4. An integrated encyclopedia of DNA elements in the human genome
5. Twelve years of SAMtools and BCFtools