Abstract
AbstractDecreasing sequencing costs have instigated large-scale RNAseq experiments, yet genetic polymorphisms in such data remain poorly exploited. Currently, allele-specific expression (ASE) studies focus almost exclusively on genetic variants explaining expression differences (cis-eQTLs), largely ignoring other ASE effects. The latter are typically associated with higher variance in expression of both copies of a gene, here called Allelic Divergence (AD). We therefore developed an RNAseq-driven population-level beta-binomial mixture model for (differential) AD detection. The model simultaneously enables RNAseq-driven genotyping, which outperforms alternative RNA genotyping methods when applied on healthy kidney data from The Cancer Genome Atlas. Moreover, we identify well-known non-cis-eQTL ASE, e.g. random monoallelic expression of HLA and immunoglobulin genes in healthy kidney, as well as allele-specific aberrations in clear cell kidney carcinoma, including long-range epigenetic silencing of protocadherins, copy-number alterations, and loss of imprinting. These methods are available as the Modeller of Allelic Gene Expression (MAGE) tool suite: https://biobix.github.io/MAGE/.
Publisher
Cold Spring Harbor Laboratory