Abstract
AbstractGene-by-gene differential expression analysis is a popular supervised learning method for analyzing single-cell RNA sequencing (scRNA-seq) data. However, the large number of cells in scRNA-seq studies often results in numerous differentially expressed genes with extremely small p-values but minimal effect sizes, complicating interpretation. To address this challenge, we developed a method called Supervised Deep Learning with gene ANnotation (SDAN). SDAN integrates gene annotation and gene expression data using a graph neural network to identify gene sets to classify cells, and then the corresponding individuals. We demonstrated the usage of SDAN by identifying gene sets associated with severe COVID-19, dementia, and cancer patients’ responses to immunotherapy.
Publisher
Cold Spring Harbor Laboratory
Reference32 articles.
1. Single-cell rna-seq denoising using a deep count autoencoder;Nature communications,2019
2. Data denoising with transfer learning in single-cell transcriptomics;Nature methods,2019
3. Data denoising and post-denoising corrections in single cell rna sequencing;Statistical Science,2020
4. Clustering single-cell rna-seq data with a model-based deep learning approach;Nature Machine Intelligence,2019
5. Deep learning enables accurate clustering with batch effect removal in single-cell rna-seq analysis;Nature communications,2020