Batch correction methods used in single cell RNA-sequencing analyses are often poorly calibrated-Reference-Cited by-同舟云学术

Batch correction methods used in single cell RNA-sequencing analyses are often poorly calibrated

Published:2024-03-21 Issue: Volume: Page:
ISSN:
Container-title:
language:
Short-container-title:

Author:

Emmanúel Antonsson Sindri^ORCID,Melsted Páll^ORCID

Abstract

AbstractAs the number of experiments that employ single-cell RNA-sequencing (scRNA-seq) grows it opens up the possibility of combining results across experiments or processing cells from the same experiment assayed in separate sequencing runs. The gain in the number of cells that can be compared comes at the cost of batch effects that may be present. Several methods have been proposed to combat this for scRNA-seq datasets.We compared seven widely used method used for batch correction of scRNA-seq datasets. We present a novel approach to measure the degree to which the methods alter the data in the process of batch correction, both at the fine scale comparing distances between cells as well as measuring effects observed across clusters of cells. We demonstrate that many of the published method are poorly calibrated in the sense that the process of correction creates measurable artifacts in the data.In particular, MNN, SCVI and LIGER performed poorly in our tests, often altering the data considerably. Batch correction with Combat, BBKNN and Seurat introduced artifacts that could be detected in our setup. However, we found that Harmony was the only method that consistently performed well, in all the testing methodology we present. Due to these result Harmony is the only method we can safely recommend using when performing batch correction of scRNA-seq data.

Publisher

Cold Spring Harbor Laboratory

Reference18 articles.

1. Integrating single-cell transcriptomic data across different conditions, technologies, and species

2. De novo assembly of bacterial transcriptomes from RNA-seq data

3. Batch effects in single-cell RNA-sequencing data are corrected by matching mutual nearest neighbors

4. Adjusting batch effects in microarray expression data using empirical Bayes methods

5. Fast, sensitive and accurate integration of single-cell data with Harmony

Cited by 3 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Evaluating batch correction methods for image-based cell profiling;Nature Communications;2024-08-02

2. A framework for quality control in quantitative proteomics;2024-04-13

3. Algorithms for a Commons Cell Atlas;2024-03-26