cFIT: Integration and transfer learning of single cell transcriptomes, illustrated by fetal brain cell development-Reference-Cited by-同舟云学术

cFIT: Integration and transfer learning of single cell transcriptomes, illustrated by fetal brain cell development

Published:2020-08-31 Issue: Volume: Page:
ISSN:
Container-title:
language:
Short-container-title:

Author:

Peng Minshi,Li Yue,Wamsley Brie,Wei Yuting,Roeder Kathryn

Abstract

AbstractLarge, comprehensive collections of scRNA-seq data sets have been generated that allow for the full transcriptional characterization of cell types across a wide variety of biological and clinical conditions. As new methods arise to measure distinct cellular modalities, a key analytical challenge is to integrate these data sets or transfer knowledge from one to the other to better understand cellular identity and functions. Here, we present a simple yet surprisingly effective method named cFIT for capturing various batch effects across experiments, technologies, subjects, and even species. The proposed method models the shared information between various data sets by a common factor space, while allowing for unique distortions and shifts in gene-wise expression in each batch. The model parameters are learned under an iterative non-negative matrix factorization (NMF) framework and then used for synchronized integration from across-domain assays. In addition, the model enables transferring via low-rank matrix from more informative data to allow for precise identification in data of lower quality. Compared to existing approaches, our method imposes weaker assumptions on the cell composition of each individual data set, however, is shown to be more reliable in preserving biological variations. We apply cFIT to multiple scRNA-seq data sets of developing brain from human and mouse, varying by technologies and developmental stages. The successful integration and transfer uncover the transcriptional resemblance across systems. The study helps establish a comprehensive landscape of brain cell type diversity and provides insights into brain development.

Publisher

Cold Spring Harbor Laboratory

Reference40 articles.

1. On the Goldstein-Levitin-Polyak gradient projection method

2. Integrating single-cell transcriptomic data across different conditions, technologies, and species

3. Human cerebral organoids recapitulate gene expression programs of fetal neocortex development

4. A survey of human brain transcriptome diversity at the single cell level

5. cellHarmony: cell-level matching and holistic comparison of single-cell transcriptomes