Single-cell gene set scoring with nearest neighbor graph smoothed data (gssnng)-Reference-Cited by-同舟云学术

Single-cell gene set scoring with nearest neighbor graph smoothed data (gssnng)

Published:2023-01-01 Issue:1 Volume:3 Page:
ISSN:2635-0041
Container-title:Bioinformatics Advances
language:en
Short-container-title:

Author:

Gibbs David L¹,Strasser Michael K²,Huang Sui²

Affiliation:

1. Shmulevich Lab, Institute for Systems Biology , Seattle, WA 98106, United States

2. Huang Lab, Institute for Systems Biology , Seattle, WA 98106, United States

Abstract

Abstract Summary Gene set scoring (or enrichment) is a common dimension reduction task in bioinformatics that can be focused on the differences between groups or at the single sample level. Gene sets can represent biological functions, molecular pathways, cell identities, and more. Gene set scores are context dependent values that are useful for interpreting biological changes following experiments or perturbations. Single sample scoring produces a set of scores, one for each member of a group, which can be analyzed with statistical models that can include additional clinically important factors such as gender or age. However, the sparsity and technical noise of single-cell expression measures create difficulties for these methods, which were originally designed for bulk expression profiling (microarrays, RNAseq). This can be greatly remedied by first applying a smoothing transformation that shares gene measure information within transcriptomic neighborhoods. In this work, we use the nearest neighbor graph of cells for matrix smoothing to produce high quality gene set scores on a per-cell, per-group, level which is useful for visualization and statistical analysis. Availability and implementation The gssnng software is available using the python package index (PyPI) and works with Scanpy AnnData objects. It can be installed using “pip install gssnng.” More information and demo notebooks: see https://github.com/IlyaLab/gssnng.

Funder

Cancer Research UK Grand Challenge

Tlsty Lab

McGill University Thoracic and Upper GI Cancer Research Laboratories

Advanced Genomic Technologies Laboratory

Publisher

Oxford University Press (OUP)

Subject

Computer Science Applications,Genetics,Molecular Biology,Structural Biology

Link

https://academic.oup.com/bioinformaticsadvances/advance-article-pdf/doi/10.1093/bioadv/vbad150/52214972/vbad150.pdf

Reference29 articles.