Abstract
AbstractAs more multi-omics data such as metabolome, proteome and transcriptome data are acquired, statistical analyses to integrate multi-omics data has been required. Multivariate analysis for integration of multi-omics data has been proposed such as multiset partial least squares (PLS). However, when we want to extract scores about the rank order of groups such as severity of disease as well as difference of groups, we could not always extract scores with rank order of groups by using multiset PLS. Therefore, we proposed multiset PLS with rank order of groups (multiset PLS-ROG) to integrate multiple omics dataset. We visualized multiple omics data by using multiset PLS-ROG in two studies of multi-omics and multi-organ derived metabolome data. After we visualize data and focus on a specific score with phenotype of interest, it is important to select compounds to identify biomarker candidates or make biological inferences. In ordinary principal component analysis (PCA) or PLS, we could select statistical significantly compounds correlated with PC or PLS score by using PC or PLS loadings. However, multiset PLS loading has not been used because its statistical property has not been clarified. Then, we clarified statistical property of multiset PLS and multiset PLS-ROG loading, and we could identify statistically significant compounds by using statistical hypothesis testing. We implemented multiset PLS-ROG to R loadings package, freely available in CRAN.
Publisher
Cold Spring Harbor Laboratory
Cited by
1 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献