Canonical correlation analysis for multi-omics: Application to cross-cohort analysis

Author:

Jiang Min-ZhiORCID,Aguet François,Ardlie Kristin,Chen Jiawen,Cornell Elaine,Cruz Dan,Durda Peter,Gabriel Stacey B.,Gerszten Robert E.,Guo Xiuqing,Johnson Craig W.,Kasela Silva,Lange Leslie A.,Lappalainen Tuuli,Liu Yongmei,Reiner Alex P.,Smith Josh,Sofer Tamar,Taylor Kent D.,Tracy Russell P.,VanDenBerg David J.,Wilson James G.,Rich Stephen S.,Rotter Jerome I.ORCID,Love Michael I.,Raffield Laura M.ORCID,Li YunORCID,

Abstract

Integrative approaches that simultaneously model multi-omics data have gained increasing popularity because they provide holistic system biology views of multiple or all components in a biological system of interest. Canonical correlation analysis (CCA) is a correlation-based integrative method designed to extract latent features shared between multiple assays by finding the linear combinations of features–referred to as canonical variables (CVs)–within each assay that achieve maximal across-assay correlation. Although widely acknowledged as a powerful approach for multi-omics data, CCA has not been systematically applied to multi-omics data in large cohort studies, which has only recently become available. Here, we adapted sparse multiple CCA (SMCCA), a widely-used derivative of CCA, to proteomics and methylomics data from the Multi-Ethnic Study of Atherosclerosis (MESA) and Jackson Heart Study (JHS). To tackle challenges encountered when applying SMCCA to MESA and JHS, our adaptations include the incorporation of the Gram-Schmidt (GS) algorithm with SMCCA to improve orthogonality among CVs, and the development of Sparse Supervised Multiple CCA (SSMCCA) to allow supervised integration analysis for more than two assays. Effective application of SMCCA to the two real datasets reveals important findings. Applying our SMCCA-GS to MESA and JHS, we identified strong associations between blood cell counts and protein abundance, suggesting that adjustment of blood cell composition should be considered in protein-based association studies. Importantly, CVs obtained from two independent cohorts also demonstrate transferability across the cohorts. For example, proteomic CVs learned from JHS, when transferred to MESA, explain similar amounts of blood cell count phenotypic variance in MESA, explaining 39.0% ~ 50.0% variation in JHS and 38.9% ~ 49.1% in MESA. Similar transferability was observed for other omics-CV-trait pairs. This suggests that biologically meaningful and cohort-agnostic variation is captured by CVs. We anticipate that applying our SMCCA-GS and SSMCCA on various cohorts would help identify cohort-agnostic biologically meaningful relationships between multi-omics data and phenotypic traits.

Funder

National Heart, Lung, and Blood Institute

National Institute on Aging

National Center for Advancing Translational Sciences

Publisher

Public Library of Science (PLoS)

Subject

Cancer Research,Genetics (clinical),Genetics,Molecular Biology,Ecology, Evolution, Behavior and Systematics

Reference63 articles.

1. Extensions of sparse canonical correlation analysis with applications to genomic data;DM Witten;Stat Appl Genet Mol Biol,2009

2. JOINT AND INDIVIDUAL VARIATION EXPLAINED (JIVE) FOR INTEGRATED ANALYSIS OF MULTIPLE DATA TYPES.;EF Lock;Ann Appl Stat.,2013

3. Multi-Omics Factor Analysis—a framework for unsupervised integration of multi-omics data sets;R Argelaguet;Mol Syst Biol,2018

4. The GTEx Consortium atlas of genetic regulatory effects across human tissues;GTEx Consortium;Science,2020

5. Large-scale cis- and trans-eQTL analyses identify thousands of genetic loci and polygenic scores that regulate blood gene expression;U Võsa;Nat Genet,2021

Cited by 12 articles. 订阅此论文施引文献 订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3