Species-Agnostic Transfer Learning for Cross-species Transcriptomics Data Integration without Gene Orthology-Reference-Cited by-同舟云学术

Species-Agnostic Transfer Learning for Cross-species Transcriptomics Data Integration without Gene Orthology

Published:2023-08-14 Issue: Volume: Page:
ISSN:
Container-title:
language:
Short-container-title:

Author:

Park Youngjun^ORCID,Muttray Nils Paul,Hauschild Anne-Christin^ORCID

Abstract

AbstractNovel hypotheses in biomedical research are often developed or validated in model organisms such as mice and zebrafish and thus play a crucial role, particularly in studying disease mechanisms and treatment responses. However, due to biological differences between species, translating these findings into human applications remains challenging. Moreover, commonly used orthologous gene information is often incomplete, particularly for non-model organisms, and entails a significant information loss during gene-id conversion. To address these issues, we present a novel methodology for species-agnostic transfer learning with heterogeneous domain adaptation. We built on the cross-domain structure-preserving projection and extended the algorithm toward out-of-sample prediction, a common challenge in biomedical sequencing data. Our approach not only allows knowledge integration and translation across various species without relying on gene orthology but also identifies similar GO biological processes amongst the most influential genes composing the latent space for species integration. Subsequently, this enables the identification and functional annotation of genes missing from public orthology databases. Finally, we evaluated our approach with four different single-cell sequencing datasets focusing on out-of-sample prediction and compared it against related machine-learning approaches. In summary, the developed model outperforms all related methods working without prior knowledge when predicting unseen cell types based on other species’ data. The results demonstrate that our novel approach allows knowledge transfer beyond species barriers without the dependency on known gene orthology but utilizing the entire gene sets.

Publisher

Cold Spring Harbor Laboratory

Reference58 articles.

1. The age of model organisms

2. Multi-omics integration in the age of million single-cell data;Nature Reviews Nephrology,2021

3. Shafer, M.E. : Cross-species analysis of single-cell transcriptomic data. Frontiers in cell and developmental biology 7, 175 (2019)

4. Transfer learning efficiently maps bone marrow cell types from mouse to human using single-cell rna sequencing;Communications biology,2020

5. scadapt: virtual adversarial domain adaptation network for single cell rna-seq data classification across platforms and species;Briefings in Bioinformatics,2021

Cited by 1 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. AutoTransOP: translating omics signatures without orthologue requirements using deep learning;npj Systems Biology and Applications;2024-01-29