Abstract
AbstractAn important quest for the life science community is to deliver a complete annotation of the human building-blocks of life, the genes and the proteins. Here, we report on a genome-wide effort to annotate all protein-coding genes based on single cell transcriptomics data representing all major tissues and organs in the human body, integrated with data from bulk transcriptomics and antibody-based tissue profiling. Altogether, 25 tissues have been analyzed with single cell transcriptomics resulting in genome-wide expression in 444 single cell types using a strategy involving pooling data from individual cells to obtain genome-wide expression profiles of individual cell type. We introduce a new genome-wide classification tool based on clustering of similar expression profiles across single cell types, which can be visualized using dimensional reduction maps (UMAP). The clustering classification is integrated with a new “tau” score classification for all protein-coding genes, resulting in a measure of single cell specificity across all cell types for all individual genes. The analysis has allowed us to annotate all human protein-coding genes with regards to function and spatial distribution across individual cell types across all major tissues and organs in the human body. A new version of the open access Human Protein Atlas (www.proteinatlas.org) has been launched to enable researchers to explore the new genome-wide annotation on an individual gene level.
Publisher
Cold Spring Harbor Laboratory
Cited by
1 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献