Affiliation:
1. Department of Biostatistics, University of Florida, Gainesville, FL 32603, USA
Abstract
With the growing use of high-throughput technologies, multi-omics data containing various types of high-dimensional omics data is increasingly being generated to explore the association between the molecular mechanism of the host and diseases. In this study, we present an adaptive sparse multi-block partial least square discriminant analysis (asmbPLS-DA), an extension of our previous work, asmbPLS. This integrative approach identifies the most relevant features across different types of omics data while discriminating multiple disease outcome groups. We used simulation data with various scenarios and a real dataset from the TCGA project to demonstrate that asmbPLS-DA can identify key biomarkers from each type of omics data with better biological relevance than existing competitive methods. Moreover, asmbPLS-DA showed comparable performance in the classification of subjects in terms of disease status or phenotypes using integrated multi-omics molecular profiles, especially when combined with other classification algorithms, such as linear discriminant analysis and random forest. We have made the R package called asmbPLS that implements this method publicly available on GitHub. Overall, asmbPLS-DA achieved competitive performance in terms of feature selection and classification. We believe that asmbPLS-DA can be a valuable tool for multi-omics research.
Subject
Genetics (clinical),Genetics
Reference58 articles.
1. Multi-omics data integration, interpretation, and its application;Subramanian;Bioinform. Biol. Insights,2020
2. Regularization paths for generalized linear models via coordinate descent;Friedman;J. Stat. Softw.,2010
3. Lê Cao, K.-A., Boitard, S., and Besse, P. (2011). Sparse PLS discriminant analysis: Biologically relevant feature selection and graphical displays for multiclass problems. BMC Bioinform., 12.
4. IPF-LASSO: Integrative-penalized regression with penalty factors for prediction based on multi-omics data;Boulesteix;Comput. Math. Methods Med.,2017
5. Regression shrinkage and selection via the lasso;Tibshirani;J. R. Stat. Soc. Ser. B (Methodol.),1996
Cited by
3 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献