Author:
Wang Jianlin,Dai Xuebing,Luo Huimin,Yan Chaokun,Zhang Ge,Luo Junwei
Abstract
The Pan-Cancer Atlas consists of original sequencing data from various sources, provides the opportunity to perform systematic studies on the commonalities and differences between diverse cancers. The analysis for the pan-cancer dataset could help researchers to identify the key factors that could trigger cancer. In this paper, we present a novel pan-cancer classification method, referred to MI_DenseNetCAM, to identify a set of genes that can differentiate all tumor types accurately. First, the Mutual Information (MI) was utilized to eliminate noise and redundancy from the pan-cancer datasets. Then, the gene data was further converted to 2D images. Next, the DenseNet model was adopted as a classifier and the Guided Grad-CAM algorithm was applied to identify the key genes. Extensive experimental results on the public RNA-seq data sets with 33 different tumor types show that our method outperforms the other state-of-the-art classification methods. Moreover, gene analysis further demonstrated that the genes selected by our method were related to the corresponding tumor types.
Funder
National Natural Science Foundation of China
China Postdoctoral Science Foundation
Subject
Genetics(clinical),Genetics,Molecular Medicine
Reference38 articles.
1. Distinct types of diffuse large b-cell lymphoma identified by gene expression profiling;Alizadeh;Nature,2000
2. A deep learning approach for cancer detection and relevant gene identification,;Danaee,2017
3. Imagenet: a large-scale hierarchical image database,;Deng,2009
4. Molecular classification of cancer: class discovery and class prediction by gene expression monitoring;Golub;science,1999
5. A probabilistic interpretation of precision, recall and f-score, with implication for evaluation,;Goutte,2005
Cited by
4 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献