Abstract
AbstractSepsis is a life-threatening condition characterized by an exaggerated immune response to pathogens, leading to organ damage and high mortality rates in the intensive care unit. Although deep learning has achieved impressive performance on prediction and classification tasks in medicine, it requires large amounts of data and lacks explainability, which hinder its application to sepsis diagnosis. We introduce a deep learning framework, called scCaT, which blends the capsulating architecture with Transformer to develop a sepsis diagnostic model using single-cell RNA sequencing data and transfers it to bulk RNA data. The capsulating architecture effectively groups genes into capsules based on biological functions, which provides explainability in encoding gene expressions. The Transformer serves as a decoder to classify sepsis patients and controls. Our model achieves high accuracy with an AUROC of 0.93 on the single-cell test set and an average AUROC of 0.98 on seven bulk RNA cohorts. Additionally, the capsules can recognize different cell types and distinguish sepsis from control samples based on their biological pathways. This study presents a novel approach for learning gene modules and transferring the model to other data types, offering potential benefits in diagnosing rare diseases with limited subjects.Author summaryDeep learning models used in disease diagnosis usually suffer from insufficient data for training and the lack of explainability, especially in rare diseases. These shortages hinder their application to sepsis diagnosis. Here we propose a diagnostic framework name scCaT(https://github.com/Kimxbzheng/CaT), which transfers knowledge learned from single-cell RNA-seq, for diseases with insufficient bulk data. The framework uses capsulating architecture to group genes into capsules and provide explainability to the deep learning model for sepsis diagnosis. ScCaT achieves robust and outstanding performance for sepsis diagnosis in both scRNA-seq and bulk RNA datasets. This architecture offers potential approaches in diagnosing rare diseases with limited subjects with explainability.
Publisher
Cold Spring Harbor Laboratory