Affiliation:
1. School of Mathematical Sciences University of Chinese Academy of Sciences Beijing China
2. Key Laboratory of Big Data Mining and Knowledge Management Chinese Academy of Sciences Beijing China
3. Department of Biostatistics Yale School of Public Health New Haven Connecticut USA
4. Pazhou Lab Guangzhou China
Abstract
AbstractGene expressions have been extensively studied in biomedical research. With gene expression, network analysis, which takes a system perspective and examines the interconnections among genes, has been established as highly important and meaningful. In the construction of gene expression networks, a commonly adopted technique is high‐dimensional regularized regression. Network construction can be unadjusted (which focuses on gene expressions only) and adjusted (which also incorporates regulators of gene expressions), and the two types of construction have different implications and can be equally important. In this article, we propose a variable selection hierarchy to connect the unadjusted regression‐based network construction with the adjusted construction that incorporates two or more types of regulators. This hierarchy is sensible and amounts to additional information for both constructions, thus having the potential of improving variable selection and estimation. An effective computational algorithm is developed, and extensive simulation demonstrates the superiority of the proposed construction over multiple closely relevant alternatives. The analysis of TCGA data further demonstrates the practical utility of the proposed approach.
Funder
Fundamental Research Funds for the Central Universities
National Natural Science Foundation of China
National Science Foundation of Sri Lanka
Subject
Computer Science Applications,Information Systems,Analysis