Abstract
SummaryIn cancer multi-omic studies, identifying the effects of somatic copy number aberrations (CNA) on physically distal gene expressions (trans-associations) can potentially uncover genes critical for cancer pathogenesis. Sparse canonical correlation analysis (SCCA) has emerged as a promising method for identifying associations in high-dimensional settings, owing to its ability to aggregate weaker associations and its improved interpretability. Traditional SCCA lacks hypothesis testing capabilities, which are critical for controlling false discoveries. This limitation has recently been addressed through a bias correction technique that enables calibrated hypothesis testing. In this article, we leverage the theoretical advancements in de-biased SCCA to present a computationally efficient pipeline for multi-omics analysis. This pipeline identifies and tests associations between multi-omics data modalities in biomedical settings, such as the trans-effects of CNA on gene expression. We propose a detailed algorithm to choose the tuning parameters of de-biased SCCA. Applying this pipeline to data on estrogen receptor (ER)-associated CNAs and 10,756 gene expressions from 1,904 breast cancer patients in the METABRIC study, we identified 456 CNAs trans-associated with 256 genes. Among these, 5 genes were identified only through de-biased SCCA and not by the standard pairwise regression approach. Downstream analysis with the 256 genes revealed that these genes were overrepresented in pathways relevant to breast cancer.
Publisher
Cold Spring Harbor Laboratory
Reference99 articles.
1. Robust biomarker identification for cancer diagnosis with ensemble feature selection methods
2. Comprehensive analysis of copy number aberrations in microsatellite stable colon cancer in view of stromal component
3. Anderson, T. W. (1962). An introduction to multivariate statistical analysis. Technical report, Wiley New York.
4. A simple measure of conditional dependence;The Annals of Statistics,2021
5. Baca-López, K. , Mayorga, M. , Hidalgo-Miranda, A. , Gutiérrez-Nájera, N. , and Hernández-Lemus, E. (2012). The role of master regulators in the metabolic/transcriptional coupling in breast carcinomas.