Optimal distance metrics for single-cell RNA-seq populations-Reference-Cited by-同舟云学术

Optimal distance metrics for single-cell RNA-seq populations

Published:2023-12-27 Issue: Volume: Page:
ISSN:
Container-title:
language:
Short-container-title:

Author:

Ji Yuge^ORCID,Green Tessa D.^ORCID,Peidli Stefan^ORCID,Bahrami Mojtaba^ORCID,Liu Meiqi,Zappia Luke^ORCID,Hrovatin Karin^ORCID,Sander Chris^ORCID,Theis Fabian J.^ORCID

Abstract

AbstractIn single-cell data workflows and modeling, distance metrics are commonly used in loss functions, model evaluation, and subpopulation analysis. However, these metrics behave differently depending on the source of variation, conditions and subpopulations in single-cell expression profiles due to data sparsity and high dimensionality. Thus, the metrics used for downstream tasks in this domain should be carefully selected. We establish a set of benchmarks with three evaluation measures, capturing desirable facets of absolute and relative distance behavior. Based on seven datasets using perturbation as ground truth, we evaluated 16 distance metrics applied to scRNA-seq data and demonstrated their application to three use cases. We find that linear metrics such as mean squared error (MSE) performed best across our three evaluation criteria. Therefore, we recommend the use of MSE for comparing single-cell RNA-seq populations and evaluating gene expression prediction models.

Publisher

Cold Spring Harbor Laboratory

Reference52 articles.

1. scGen predicts single-cell perturbation responses;Nat. Methods,2019

2. An empirical Bayes method for differential expression analysis of single cells with deep generative models;Proc. Natl. Acad. Sci. U. S. A,2023

3. Deep generative modeling for single-cell transcriptomics;Nat. Methods,2018

4. Batch effects in single-cell RNA-sequencing data are corrected by matching mutual nearest neighbors

5. Efficient integration of heterogeneous single-cell transcriptomes using Scanorama;Nat. Biotechnol,2019

Cited by 3 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Optimal transport for single-cell and spatial omics;Nature Reviews Methods Primers;2024-08-14

2. Pertpy: an end-to-end framework for perturbation analysis;2024-08-07

3. Transcriptome-wide characterization of genetic perturbations;2024-07-03