Incorporating graph information in Bayesian factor analysis with robust and adaptive shrinkage priors

Author:

Zhang Qiyiwen1ORCID,Chang Changgee2ORCID,Shen Li1ORCID,Long Qi1ORCID

Affiliation:

1. Department of Biostatistics, Epidemiology, and Informatics, Perelman School of Medicine, University of Pennsylvania , Philadelphia, PA 19104 , United States

2. Department of Biostatistics and Health Data Science, Indiana University School of Medicine , Indianapolis, IN 47405 , United States

Abstract

ABSTRACT There has been an increasing interest in decomposing high-dimensional multi-omics data into a product of low-rank and sparse matrices for the purpose of dimension reduction and feature engineering. Bayesian factor models achieve such low-dimensional representation of the original data through different sparsity-inducing priors. However, few of these models can efficiently incorporate the information encoded by the biological graphs, which has been already proven to be useful in many analysis tasks. In this work, we propose a Bayesian factor model with novel hierarchical priors, which incorporate the biological graph knowledge as a tool of identifying a group of genes functioning collaboratively. The proposed model therefore enables sparsity within networks by allowing each factor loading to be shrunk adaptively and by considering additional layers to relate individual shrinkage parameters to the underlying graph information, both of which yield a more accurate structure recovery of factor loadings. Further, this new priors overcome the phase transition phenomenon, in contrast to existing graph-incorporated approaches, so that it is robust to noisy edges that are inconsistent with the actual sparsity structure of the factor loadings. Finally, our model can handle both continuous and discrete data types. The proposed method is shown to outperform several existing factor analysis methods through simulation experiments and real data analyses.

Funder

National Institutes of Health

Publisher

Oxford University Press (OUP)

Subject

Applied Mathematics,General Agricultural and Biological Sciences,General Immunology and Microbiology,General Biochemistry, Genetics and Molecular Biology,General Medicine,Statistics and Probability

Reference45 articles.

1. Scale mixtures of normal distributions;Andrews;Journal of the Royal Statistical Society: Series B (Methodological),1974

2. Multi-omics factor analysis-a framework for unsupervised integration of multi-omics data sets;Argelaguet,2018

3. Sparse Bayesian infinite factor models;Bhattacharya;Biometrika,2011

4. Normalization approaches for removing systematic biases associated with mass spectrometry and label-free proteomics;Callister;Journal of Proteome Research,2006

5. The horseshoe estimator for sparse signals;Carvalho;Biometrika,2010

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3