Author:
Mourragui Soufiane M. C.,Loog Marco,van Nee Mirrelijn,de Wiel Mark A van,Reinders Marcel J. T.,Wessels Lodewyk F. A.
Abstract
AbstractMotivation: Anti-cancer drugs may elicit resistance or sensitivity through mechanisms which involve several genomic layers. Nevertheless, we have demonstrated that gene expression contains most of the predictive capacity compared to the remaining omic data types. Unfortunately, this comes at a price: gene expression biomarkers are often hard to interpret and show poor robustness.Results: To capture the best of both worlds, i.e. the accuracy of gene expression and the robustness of other genomic levels, such as mutations, copy-number or methylation, we developed Percolate, a computational approach which extracts the joint signal between gene expression and the other omic data types. We developed an out-of-sample extension of Percolate which allows predictions on unseen samples without the necessity to recompute the joint signal on all data. We employed Percolate to extract the joint signal between gene expression and either mutations, copy-number or methylation, and used the out-of sample extension to perform response prediction on unseen samples. We showed that the joint signal recapitulates, and sometimes exceeds, the predictive performance achieved with each data type individually. Importantly, molecular signatures created by Percolate do not require gene expression to be evaluated, rendering them suitable to clinical applications where only one data type is available.Availability: Percolate is available as a Python 3.7 package and the scripts to reproduce the results are available here.
Publisher
Springer Nature Switzerland
Reference42 articles.
1. Aben, N., et al.: TANDEM: a two-stage approach to maximize interpretability of drug response models based on multiple molecular data types. Bioinformatics 32(17), i413–i420 (2016)
2. Aben, N., et al.: ITOP: inferring the topology of omics data. Bioinformatics 34(17), i988–i996 (2018)
3. Akaike, H.: A new look at the statistical model identification. IEEE Trans. Autom. Control 19(6), 716–723 (1974)
4. Argelaguet, R., et al.: Multi-omics factor analysis-a framework for unsupervised integration of multi-omics data sets. Mol. Syst. Biol. 14(6), 1–13 (2018)
5. Bersanelli, M., et al.: Methods for the integration of multi-omics data: mathematical aspects. BMC Bioinform. 17(2), 167–177 (2016)
Cited by
2 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献