Assessing Versatile Machine Learning Models for Glioma Radiogenomic Studies across Hospitals-Reference-Cited by-同舟云学术

Assessing Versatile Machine Learning Models for Glioma Radiogenomic Studies across Hospitals

Published:2021-07-19 Issue:14 Volume:13 Page:3611
ISSN:2072-6694
Container-title:Cancers
language:en
Short-container-title:Cancers

Author:

Kawaguchi Risa K.^ORCID,Takahashi Masamichi^ORCID,Miyake Mototaka,Kinoshita Manabu^ORCID,Takahashi Satoshi,Ichimura Koichi,Hamamoto Ryuji,Narita Yoshitaka,Sese Jun

Abstract

Radiogenomics use non-invasively obtained imaging data, such as magnetic resonance imaging (MRI), to predict critical biomarkers of patients. Developing an accurate machine learning (ML) technique for MRI requires data from hundreds of patients, which cannot be gathered from any single local hospital. Hence, a model universally applicable to multiple cohorts/hospitals is required. We applied various ML and image pre-processing procedures on a glioma dataset from The Cancer Image Archive (TCIA, n = 159). The models that showed a high level of accuracy in predicting glioblastoma or WHO Grade II and III glioma using the TCIA dataset, were then tested for the data from the National Cancer Center Hospital, Japan (NCC, n = 166) whether they could maintain similar levels of high accuracy. Results: we confirmed that our ML procedure achieved a level of accuracy (AUROC = 0.904) comparable to that shown previously by the deep-learning methods using TCIA. However, when we directly applied the model to the NCC dataset, its AUROC dropped to 0.383. Introduction of standardization and dimension reduction procedures before classification without re-training improved the prediction accuracy obtained using NCC (0.804) without a loss in prediction accuracy for the TCIA dataset. Furthermore, we confirmed the same tendency in a model for IDH1/2 mutation prediction with standardization and application of dimension reduction that was also applicable to multiple hospitals. Our results demonstrated that overfitting may occur when an ML method providing the highest accuracy in a small training dataset is used for different heterogeneous data sets, and suggested a promising process for developing an ML method applicable to multiple cohorts.

Publisher

MDPI AG

Subject

Cancer Research,Oncology

Link

https://www.mdpi.com/2072-6694/13/14/3611/pdf

Reference56 articles.

1. Radiomics in Brain Tumor: Image Assessment, Quantitative Feature Descriptors, and Machine-Learning Approaches

2. Conventional and advanced magnetic resonance imaging in patients with high-grade glioma

3. Computational Radiomics System to Decode the Radiographic Phenotype

4. Emerging Applications of Artificial Intelligence in Neuro-Oncology

5. Radiogenomics of Glioblastoma: Machine Learning–based Classification of Molecular Characteristics by Using Multiparametric and Multiregional MR Imaging Features

Cited by 13 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Artificial intelligence-based MRI radiomics and radiogenomics in glioma;Cancer Imaging;2024-03-14

2. MRI-based model for accurate prediction of P53 gene status in gliomas;Electronic Research Archive;2024

3. Integrating Multi-Omics Data With EHR for Precision Medicine Using Advanced Artificial Intelligence;IEEE Reviews in Biomedical Engineering;2024

4. Application and constraints of AI in radiomics and radiogenomics (R-n-R) studies of neuro-oncology;Radiomics and Radiogenomics in Neuro-Oncology;2024

5. Predicting academic performance associated with physical fitness of primary school students using machine learning methods;Complementary Therapies in Clinical Practice;2023-05