Pan-cancer classification by regularized multi-task learning-Reference-Cited by-同舟云学术

Pan-cancer classification by regularized multi-task learning

Published:2021-12 Issue:1 Volume:11 Page:
ISSN:2045-2322
Container-title:Scientific Reports
language:en
Short-container-title:Sci Rep

Author:

Hossain Sk Md Mosaddek,Khatun Lutfunnesa,Ray Sumanta,Mukhopadhyay Anirban

Abstract

AbstractClassifying pan-cancer samples using gene expression patterns is a crucial challenge for the accurate diagnosis and treatment of cancer patients. Machine learning algorithms have been considered proven tools to perform downstream analysis and capture the deviations in gene expression patterns across diversified diseases. In our present work, we have developed PC-RMTL, a pan-cancer classification model using regularized multi-task learning (RMTL) for classifying 21 cancer types and adjacent normal samples using RNASeq data obtained from TCGA. PC-RMTL is observed to outperform when compared with five state-of-the-art classification algorithms, viz. SVM with the linear kernel (SVM-Lin), SVM with radial basis function kernel (SVM-RBF), random forest (RF), k-nearest neighbours (kNN), and decision trees (DT). The PC-RMTL achieves 96.07% accuracy and 95.80% MCC score for a completely unknown independent test set. The only method that appears as the real competitor is SVM-Lin, which nearly equalizes the accuracy in prediction of PC-RMTL but only when complete feature sets are provided for training; otherwise, PC-RMTL outperformed all other classification models. To the best of our knowledge, this is a significant improvement over all the existing works in pan-cancer classification as they have failed to classify many cancer types from one another reliably. We have also compared gene expression patterns of the top discriminating genes across the cancers and performed their functional enrichment analysis that uncovers several interesting facts in distinguishing pan-cancer samples.

Publisher

Springer Science and Business Media LLC

Subject

Multidisciplinary

Link

https://www.nature.com/articles/s41598-021-03554-8.pdf

Reference44 articles.

1. Kourou, K., Exarchos, T. P., Exarchos, K. P., Karamouzis, M. V. & Fotiadis, D. I. Machine learning applications in cancer prognosis and prediction. Comput. Struct. Biotechnol. J. 13, 8–17. https://doi.org/10.1016/j.csbj.2014.11.005 (2015).

2. Douglas, Y. The next decade of gene expression profiling. Drug Discov. https://www.ddw-online.com/the-next-decade-of-gene-expression-profiling-715-200508/ (2005).

3. Hossain, S. M. M., Khatun, L., Ray, S. & Mukhopadhyay, A. Identification of key immune regulatory genes in HIV-1 progression. Gene 792, 145735; https://doi.org/10.1016/j.gene.2021.145735 (2021).

4. Hossain, S. M. M., Halsana, A. A., Khatun, L., Ray, S. & Mukhopadhyay, A. Discovering key transcriptomic regulators in pancreatic ductal adenocarcinoma using Dirichlet process Gaussian mixture model. Sci. Rep. 11, 7853. https://doi.org/10.1038/s41598-021-87234-7 (2021).

5. Ray, S., Hossain, S. M. M., Khatun, L. & Mukhopadhyay, A. A comprehensive analysis on preservation patterns of gene co-expression networks during Alzheimer's disease progression. BMC Bioinform. 18, 579. https://doi.org/10.1186/s12859-017-1946-8 (2017).

Cited by 10 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Occlusion enhanced pan-cancer classification via deep learning;BMC Bioinformatics;2024-08-08

2. LASSO Based Analysis for Prediction of Prognostic Signature Genes Associated with Breast Cancer;2024-05-15

3. A platform-independent AI tumor lineage and site (ATLAS) classifier;Communications Biology;2024-03-13

4. SVM-DO: identification of tumor-discriminating mRNA signatures via support vectormachines supported by disease ontology;Turkish Journal of Biology;2023-12-28

5. An Autoencoder-based Deep Learning Approach for Classifying Colon and Rectum Adenocarcinoma;2023 14th International Conference on Computing Communication and Networking Technologies (ICCCNT);2023-07-06