MOTL: enhancing multi-omics matrix factorization with transfer learning-Reference-Cited by-同舟云学术

MOTL: enhancing multi-omics matrix factorization with transfer learning

Published:2024-03-25 Issue: Volume: Page:
ISSN:
Container-title:
language:
Short-container-title:

Author:

Hirst David^ORCID,Térézol Morgane^ORCID,Cantini Laura^ORCID,Villoutreix Paul^ORCID,Vignes Matthieu^ORCID,Baudot Anaïs^ORCID

Abstract

AbstractJoint matrix factorization is a popular method for extracting lower dimensional representations of multi-omics data. It disentangles underlying mixtures of biological signals, facilitating efficient sample clustering, disease subtyping, or biomarker identification, for instance. However, when a multi-omics dataset is generated from only a limited number of samples, the effectiveness of matrix factorization is reduced. Addressing this limitation, we introduce MOTL (Multi-Omics Transfer Learning), a novel framework for multi-omics matrix factorization with transfer learning based on MOFA (Multi-Omics Factor Analysis). MOTL infers latent factors for a small multi-omics dataset, with respect to those inferred from a large heterogeneous learning dataset. We designed two protocols to evaluate transfer learning approaches, based on simulated and real multi-omics data. Using these protocols, we observed that MOTL improves the factorization of multi-omics datasets, comprised of a limited number of samples, when compared to factorization without transfer learning. We showcase the usefulness of MOTL on a glioblastoma dataset comprised of a small number of samples, revealing an enhanced delineation of cancer status and subtype thanks to transfer learning.

Publisher

Cold Spring Harbor Laboratory

Reference47 articles.

1. MOFA+: a statistical framework for comprehensive integration of multi-modal single-cell data;Genome Biology,2020

2. Multi‐Omics Factor Analysis—a framework for unsupervised integration of multi‐omics data sets

3. Banerjee, J. , Taroni, J. N. , Allaway, R. J. , Prasad, D. V. , Guinney, J. , and Greene, C. (2023). Machine learning in rare disease. Nature Methods, pages 1–12. Publisher: Nature Publishing Group.

4. Variational Inference: A Review for Statisticians;Journal of the American Statistical Association,2017