A deep learning model to classify neoplastic state and tissue origin from transcriptomic data-Reference-Cited by-同舟云学术

A deep learning model to classify neoplastic state and tissue origin from transcriptomic data

Published:2022-06-11 Issue:1 Volume:12 Page:
ISSN:2045-2322
Container-title:Scientific Reports
language:en
Short-container-title:Sci Rep

Author:

Hong James,Hachem Laureen D.,Fehlings Michael G.

Abstract

AbstractApplication of deep learning methods to transcriptomic data has the potential to enhance the accuracy and efficiency of tissue classification and cell state identification. Herein, we developed a multitask deep learning model for tissue classification combining publicly available whole transcriptomic (RNA-seq) datasets of non-neoplastic, neoplastic and peri-neoplastic tissue to classify disease state, tissue origin and neoplastic subclass. RNA-seq data from a total of 10,116 patient samples processed through a common pipeline were used for model training and validation. The model achieved 99% accuracy for disease state classification (ROC-AUC of 0.98) and 97% accuracy for tissue origin (ROC-AUC of 0.99). Moreover, the model achieved an accuracy of 92% (ROC-AUC 0.95) for neoplastic subclassification. This is the first multitask deep learning algorithm developed for tissue classification employing a uniform pipeline analysis of transcriptomic data with multiple tissue classifiers. This model serves as a framework for incorporating large transcriptomic datasets across conditions to facilitate clinical diagnosis and cell-based treatment strategies.

Publisher

Springer Science and Business Media LLC

Subject

Multidisciplinary

Link

https://www.nature.com/articles/s41598-022-13665-5.pdf

Reference31 articles.

1. Cheung, C. C., Martin, B. R. & Asa, S. L. Defining diagnostic tissue in the era of personalized medicine. CMAJ 185, 135–139. https://doi.org/10.1503/cmaj.120565 (2013).

2. Davidson, E. H. & Erwin, D. H. Gene regulatory networks and the evolution of animal body plans. Science 311, 796–800. https://doi.org/10.1126/science.1113832 (2006).

3. Courtiol, P. et al. Deep learning-based classification of mesothelioma improves prediction of patient outcome. Nat. Med. 25, 1519–1525. https://doi.org/10.1038/s41591-019-0583-3 (2019).

4. Xu, Q. et al. Pan-cancer transcriptome analysis reveals a gene expression signature for the identification of tumor tissue origin. Mod. Pathol. 29, 546–556. https://doi.org/10.1038/modpathol.2016.60 (2016).

5. Burke, E. E. et al. Dissecting transcriptomic signatures of neuronal differentiation and maturation using iPSCs. Nat. Commun. 11, 462. https://doi.org/10.1038/s41467-019-14266-z (2020).

Cited by 7 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. A variational autoencoder trained with priors from canonical pathways increases the interpretability of transcriptome data;PLOS Computational Biology;2024-07-03

2. New techniques to identify the tissue of origin for cancer of unknown primary in the era of precision medicine: progress and challenges;Briefings in Bioinformatics;2024-01-22

3. Multi-omics based artificial intelligence for cancer research;Advances in Cancer Research;2024

4. The practical utility of AI-assisted molecular profiling in the diagnosis and management of cancer of unknown primary: an updated review;Virchows Archiv;2023-11-24

5. Machine learning for pan-cancer classification based on RNA sequencing data;Frontiers in Molecular Biosciences;2023-11-10