Abstract
AbstractMajor depressive disorder (MDD) is a leading cause of disability worldwide, and is commonly treated with antidepressant drugs (AD). Although effective, many patients fail to respond to AD treatment, and accordingly identifying factors that can predict AD response would greatly improve treatment outcomes. In this study, we developed a machine learning tool to integrate multi-omic datasets (gene expression, DNA methylation, and genotyping) to identify biomarker profiles associated with AD response in a cohort of individuals with MDD. To address this rich multi-omic dataset with high dimensional features, we developed integrative Geneset-Embedded non-negative Matrix factorization (iGEM), a non-negative matrix factorization (NMF) based model, supplemented with auxiliary information regarding genesets and gene-methylation relationships. Using our model, we identified a number of meta-phenotypes which were related to AD response. By integrating geneset information into the model, we were able to relate these meta-phenotypes to biological processes, including immune and inflammatory functions. This represents both biomarkers to predict response, as well as potential new treatment targets. Our method is applicable to other diseases with multi-omic data, and the software is open source and available on Github (https://github.com/li-lab-mcgill/iGEM).
Publisher
Cold Spring Harbor Laboratory