Sample demultiplexing, multiplet detection, experiment planning and novel cell type verification in single cell sequencing-Reference-Cited by-同舟云学术

Sample demultiplexing, multiplet detection, experiment planning and novel cell type verification in single cell sequencing

Published:2019-11-04 Issue: Volume: Page:
ISSN:
Container-title:
language:
Short-container-title:

Author:

Xin Hongyi^ORCID,Yan Qi^ORCID,Jiang Yale,Lian Qiuyu^ORCID,Luo Jiadi,Erb Carla,Duerr Richard^ORCID,Chen Kong,Chen Wei^ORCID

Abstract

AbstractIdentifying and removing multiplets from downstream analysis is essential to improve the scalability and reliability of single cell RNA sequencing (scRNA-seq). High multiplet rates create artificial cell types in the dataset. Sample barcoding, including the cell hashing technology and the MULTI-seq technology, enables analytical identification of a fraction of multiplets in a scRNA-seq dataset.We propose a Gaussian-mixture-model-based multiplet identification method, GMM-Demux. GMM-Demux accurately identifies and removes the sample-barcoding-detectable multiplets and estimates the percentage of sample-barcoding-undetectable multiplets in the remaining dataset. GMM-Demux describes the droplet formation process with an augmented binomial probabilistic model, and uses the model to authenticate cell types discovered from a scRNA-seq dataset.We conducted two cell-hashing experiments, collected a public cell-hashing dataset, and generated a simulated cellhashing dataset. We compared the classification result of GMM-Demux against a state-of-the-art heuristic-based classifier. We show that GMM-Demux is more accurate, more stable, reduces the error rate by up to 69×, and is capable of reliably recognizing 9 multiplet-induced fake cell types and 8 real cell types in a PBMC scRNA-seq dataset.

Publisher

Cold Spring Harbor Laboratory

Reference46 articles.

1. A Public BCR Present in a Unique Dual-Receptor-Expressing Lymphocyte from Type 1 Diabetes Patients Encodes a Potent T Cell Autoantigen

2. Learning regulatory models for cell development from single cell transcriptomic data;Current Opinion in Systems Biology,2017

3. Estimating the frequency of multiplets in single-cell RNA sequencing from cell-mixing experiments;Peer J,2018

4. Integrating single-cell transcriptomic data across different conditions, technologies, and species

5. Order under uncertainty: robust differential expression analysis using probabilistic models for pseudotime inference;PLoS Computational Biology,2016

Cited by 2 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Artificial-cell-type aware cell-type classification in CITE-seq;Bioinformatics;2020-07-01

2. Artificial-Cell-Type Aware Cell Type Classification in CITE-seq;2020-02-02