Affiliation:
1. Division of Biostatistics and Health Data Science School of Public Health University of Minnesota Minnesota USA
2. Department of Mathematics and Statistics University of Minnesota Duluth Minnesota USA
Abstract
AbstractWe introduce a Bayesian approach for biclustering that accounts for the prior functional dependence between genes using hidden Markov models (HMMs). We utilize biological knowledge gathered from gene ontologies and the hidden Markov structure to capture the potential coexpression of neighboring genes. Our interpretable model‐based clustering characterized each cluster of samples by three groups of features: overexpressed, underexpressed, and irrelevant features. The proposed methods have been implemented in an R package and are used to analyze both the simulated data and The Cancer Genome Atlas kidney cancer data.
Funder
National Institutes of Health
University of Minnesota
Natural Sciences and Engineering Research Council of Canada