Balanced Functional Module Detection in genomic data

Author:

Tritchler David12,Towle-Miller Lorin M1ORCID,Miecznikowski Jeffrey C1

Affiliation:

1. Department of Biostatistics, University at Buffalo, Buffalo, NY 14260, USA

2. Biostatistics Division, University of Toronto, Toronto, ON M5S 1A1, Canada

Abstract

Abstract Motivation High-dimensional genomic data can be analyzed to understand the effects of variables on a target variable such as a clinical outcome. For understanding the underlying biological mechanism affecting the target, it is important to discover the complete set of relevant variables. Thus variable selection is a primary goal, which differs from a prediction criterion. Of special interest are functional modules, cooperating sets of variables affecting the target which can be characterized by a graph. In applications such as social networks, the concept of balance in undirected signed graphs characterizes the consistency of associations within the network. This property requires that the module variables have a joint effect on the target outcome with no internal conflict, an efficiency that may be applied to biological networks. Results In this paper, we model genomic variables in signed undirected graphs for applications where the set of predictor variables influences an outcome. Consequences of the balance property are exploited to implement a new module discovery algorithm, balanced Functional Module Detection (bFMD), which selects a subset of variables from high-dimensional data that compose a balanced functional module. Our bFMD algorithm performed favorably in simulations as compared to other module detection methods. Additionally, bFMD detected interpretable results in an application using RNA-seq data obtained from subjects with Uterine Corpus Endometrial Carcinoma using the percentage of tumor invasion as the outcome of interest. The variables selected by bFMD have improved interpretability due to the logical consistency afforded by the balance property. Supplementary information Supplementary data are available at Bioinformatics Advances online.

Funder

New York State Department of Health [Empire Clinical Research Investigator Program to L.M.T.-M.]

Publisher

Oxford University Press (OUP)

Reference38 articles.

1. Connecting the dots: econometric methods for uncovering networks with an application to the Australian Financial Institutions;Anufriev;J. Banking Finance,2015

2. Semi-supervised methods to predict patient survival from gene expression data;Bair;PLoS Biol,2004

3. Prediction by supervised principal components;Bair;J. Am. Stat. Assoc,2006

4. Multilayer Networks

5. Surgical staging in endometrial cancer: clinical-pathologic findings of a prospective study;Boronow;Obstet. Gynecol,1984

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3