Abstract
Prioritization or ranking of different cell types in a single-cell RNA sequencing (scRNA-seq) framework can be performed in a variety of ways, some of these include: i) obtaining an indication of the proportion of cell types between the different conditions under study, ii) counting the number of differentially expressed genes (DEGs) between cell types and conditions in the experiment or, iii) prioritizing cell types based on prior knowledge about the conditions under study (i.e., a specific disease). These methods have drawbacks and limitations thus novel methods for improving cell ranking are required. Here we present a novel methodology that exploits prior knowledge in combination with expert-user information to accentuate cell types from a scRNA-seq analysis that yield the most biologically meaningful results with respect to a disease under study. Our methodology allows for ranking and prioritization of cell types based on how well their expression profiles relate to the molecular mechanisms and drugs associated with a disease. Molecular mechanisms, as well as drugs, are incorporated as prior knowledge in a standardized, structured manner. Cell types are then ranked/prioritized based on how well results from data-driven analysis of scRNA-seq data match the predefined prior knowledge. In additional cell-cell communication perturbations between disease and control networks are used to further prioritize/rank cell types. Our methodology has substantial advantages to more traditional cell ranking techniques and provides an informative complementary methodology that utilizes prior knowledge in a rapid and automated manner, that has previously not been attempted by other studies. The current methodology is also implemented as an R package entitled Single Cell Ranking Analysis Toolkit (scRANK) and is available for download and installation via GitHub (https://github.com/aoulas/scRANK).
Funder
HORIZON EUROPE Framework Programme
Publisher
Public Library of Science (PLoS)