Abstract
AbstractGene expression signatures (GES) connect phenotypes to mRNA expression patterns, providing a powerful approach to define cellular identity, function, and the effects of perturbations. However, the use of GES has suffered from vague assessment criteria and limited reproducibility. The structure of proteins defines the functional capability of genes, and hence, we hypothesized that enrichment of structural features could be a generalizable representation of gene sets. We derive structural gene expression signatures (sGES) using features from various levels of protein structure (e.g. domain, fold) encoded by the transcribed genes in GES, to describe cellular phenotypes. Comprehensive analyses of data from the Genotype-Tissue Expression Project (GTEx), ARCHS4, and mRNA expression of drug effects on cardiomyocytes show that structural GES (sGES) are useful for identifying robust signatures of biological phenomena. sGES also enables the characterization of signatures across experimental platforms, facilitates the interoperability of expression datasets, and can describe drug action on cells.
Publisher
Cold Spring Harbor Laboratory
Reference33 articles.
1. GEN3VA: aggregation and analysis of gene expression signatures from related studies;BMC Bioinformatics.,2016
2. GSMA: an approach to identify robust global and test Gene Signatures using Meta-Analysis
3. HiFreSP: A novel high-frequency sub-pathway mining approach to identify robust prognostic gene signatures | Briefings in Bioinformatics | Oxford Academic, (available at https://academic.oup.com/bib/advance-article/doi/10.1093/bib/bbz078/5536887).
4. The Gene Ontology Resource: 20 years and still GOing strong
5. Gene Ontology: tool for the unification of biology