A snapshot neural ensemble method for cancer-type prediction based on copy number variations-Reference-Cited by-同舟云学术

A snapshot neural ensemble method for cancer-type prediction based on copy number variations

Published:2019-11-30 Issue:19 Volume:32 Page:15281-15299
ISSN:0941-0643
Container-title:Neural Computing and Applications
language:en
Short-container-title:Neural Comput & Applic

Author:

Karim Md. Rezaul^ORCID,Rahman Ashiqur,Jares João Bosco,Decker Stefan,Beyan Oya

Abstract

AbstractAn accurate diagnosis and prognosis for cancer are specific to patients with particular cancer types and molecular traits, which needs to address carefully. The discovery of important biomarkers is becoming an important step toward understanding the molecular mechanisms of carcinogenesis in which genomics data and clinical outcomes need to be analyzed before making any clinical decision. Copy number variations (CNVs) are found to be associated with the risk of individual cancers and hence can be used to reveal genetic predispositions before cancer develops. In this paper, we collect the CNVs data about 8000 cancer patients covering 14 different cancer types from The Cancer Genome Atlas. Then, two different sparse representations of CNVs based on 578 oncogenes and 20,308 protein-coding genes, including genomic deletions and duplication across the samples, are prepared. Then, we train Conv-LSTM and convolutional autoencoder (CAE) networks using both representations and create snapshot models. While the Conv-LSTM can capture locally and globally important features, CAE can utilize unsupervised pretraining to initialize the weights in the subsequent convolutional layers against the sparsity. Model averaging ensemble (MAE) is then applied to combine the snapshot models in order to make a single prediction. Finally, we identify most significant CNVs biomarkers using guided-gradient class activation map plus (GradCAM++) and rank top genes for different cancer types. Results covering several experiments show fairly high prediction accuracies for the majority of cancer types. In particular, using protein-coding genes, Conv-LSTM and CAE networks can predict cancer types correctly at least 72.96% and 76.77% of the cases, respectively. Contrarily, using oncogenes gives moderately higher accuracies of 74.25% and 78.32%, whereas the snapshot model based on MAE shows overall 2.5% of accuracy improvement.

Publisher

Springer Science and Business Media LLC

Subject

Artificial Intelligence,Software

Link

http://link.springer.com/content/pdf/10.1007/s00521-019-04616-9.pdf

Reference54 articles.

1. Ahmad M, Alqarni MA, Khan AM, Hussain R, Mazzara M, Distefano S (2019) Segmented and non-segmented stacked denoising autoencoder for hyperspectral band reduction. Optik 180:370–378

2. AlShibli A, Mathkour H (2019) A shallow convolutional learning network for classification of cancers based on copy number variations. Sensors 19(19):4207

3. Blass BE (2017) Editorial for cancer virtual issue

4. Buckland PR (2003) Polymorphically duplicated genes: their relevance to phenotypic variation in humans. Ann Med 35(5):308–315

5. Calcagno DQ et al (2013) MYC, FBXW7 and TP53 copy number variation and expression in gastric cancer. BMC Gastroenterol 13(1):141

Cited by 20 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Using Copy Number Variation Data and Neural Networks to Predict Cancer Metastasis Origin Achieves High Area under the Curve Value with a Trade-Off in Precision;Current Issues in Molecular Biology;2024-08-01

2. Ensemble deep learning for Alzheimer’s disease characterization and estimation;Nature Mental Health;2024-05-03

3. Fatigue Performance of Double-Layer Beams with an Interlayer Based on the Method of Equivalent Fatigue Life;Journal of Materials in Civil Engineering;2024-02

4. DRI-UNet: dense residual-inception UNet for nuclei identification in microscopy cell images;Neural Computing and Applications;2023-06-22

5. Predictive modelling for molecular cancer profile classification using hybrid learning techniques;Soft Computing;2023-04-19