Affiliation:
1. School of Computer Science and Engineering at the Sun Yat-sen University, China
2. Sun Yat-sen Memorial Hospital at the Sun Yat-sen University, China
3. School of Computer Science and Engineering and the National Super Computer Center at Guangzhou, Sun Yat-sen University, China
Abstract
Abstract
In single cell analyses, cell types are conventionally identified based on expressions of known marker genes, whose identifications are time-consuming and irreproducible. To solve this issue, many supervised approaches have been developed to identify cell types based on the rapid accumulation of public datasets. However, these approaches are sensitive to batch effects or biological variations since the data distributions are different in cross-platforms or species predictions. In this study, we developed scAdapt, a virtual adversarial domain adaptation network, to transfer cell labels between datasets with batch effects. scAdapt used both the labeled source and unlabeled target data to train an enhanced classifier and aligned the labeled source centroids and pseudo-labeled target centroids to generate a joint embedding. The scAdapt was demonstrated to outperform existing methods for classification in simulated, cross-platforms, cross-species, spatial transcriptomic and COVID-19 immune datasets. Further quantitative evaluations and visualizations for the aligned embeddings confirm the superiority in cell mixing and the ability to preserve discriminative cluster structure present in the original datasets.
Funder
Introducing Innovative and Entrepreneurial Teams
Natural Science Foundation of Guangdong, China
Guangdong Frontier & Key Tech Innovation Program
National Natural Science Foundation of China
National Key R&D Program of China
Publisher
Oxford University Press (OUP)
Subject
Molecular Biology,Information Systems
Cited by
16 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献