The Machine Learning Model for Distinguishing Pathological Subtypes of Non-Small Cell Lung Cancer-Reference-Cited by-同舟云学术

The Machine Learning Model for Distinguishing Pathological Subtypes of Non-Small Cell Lung Cancer

Published:2022-05-26 Issue: Volume:12 Page:
ISSN:2234-943X
Container-title:Frontiers in Oncology
language:
Short-container-title:Front. Oncol.

Author:

Zhao Hongyue,Su Yexin,Wang Mengjiao,Lyu Zhehao,Xu Peng,Jiao Yuying,Zhang Linhan,Han Wei,Tian Lin,Fu Peng

Abstract

PurposeMachine learning models were developed and validated to identify lung adenocarcinoma (LUAD) and lung squamous cell carcinoma (LUSC) using clinical factors, laboratory metrics, and 2-deoxy-2[18F]fluoro-D-glucose ([18F]F-FDG) positron emission tomography (PET)/computed tomography (CT) radiomic features.MethodsOne hundred and twenty non-small cell lung cancer (NSCLC) patients (62 LUAD and 58 LUSC) were analyzed retrospectively and randomized into a training group (n = 85) and validation group (n = 35). A total of 99 feature parameters—four clinical factors, four laboratory indicators, and 91 [18F]F-FDG PET/CT radiomic features—were used for data analysis and model construction. The Boruta algorithm was used to screen the features. The retained minimum optimal feature subset was input into ten machine learning to construct a classifier for distinguishing between LUAD and LUSC. Univariate and multivariate analyses were used to identify the independent risk factors of the NSCLC subtype and constructed the Clinical model. Finally, the area under the receiver operating characteristic curve (AUC) values, sensitivity, specificity, and accuracy (ACC) was used to validate the machine learning model with the best performance effect and Clinical model in the validation group, and the DeLong test was used to compare the model performance.ResultsBoruta algorithm selected the optimal subset consisting of 13 features, including two clinical features, two laboratory indicators, and nine PEF/CT radiomic features. The Random Forest (RF) model and Support Vector Machine (SVM) model in the training group showed the best performance. Gender (P=0.018) and smoking status (P=0.011) construct the Clinical model. In the validation group, the SVM model (AUC: 0.876, ACC: 0.800) and RF model (AUC: 0.863, ACC: 0.800) performed well, while Clinical model (AUC:0.712, ACC: 0.686) performed moderately. There was no significant difference between the RF and Clinical models, but the SVM model was significantly better than the Clinical model. ConclusionsThe proposed SVM and RF models successfully identified LUAD and LUSC. The results indicate that the proposed model is an accurate and noninvasive predictive tool that can assist clinical decision-making, especially for patients who cannot have biopsies or where a biopsy fails.

Publisher

Frontiers Media SA

Subject

Cancer Research,Oncology

Reference32 articles.

1. Global Cancer Statistics 2020: GLOBOCAN Estimates of Incidence and Mortality Worldwide for 36 Cancers in 185 Countries;Sung;CA Cancer J Clin,2021

2. Mutation Incidence and Coincidence in non Small-Cell Lung Cancer: Meta-Analyses by Ethnicity and Histology (Mutmap);Dearden;Ann Oncol,2013

3. Alflutinib (AST2818), Primarily Metabolized by CYP3A4, Is a Potent CYP3A4 Inducer;Liu;Acta Pharmacol Sin,2020

4. Intra-Tumoural Heterogeneity Characterization Through Texture and Colour Analysis for Differentiation of Non-Small Cell Lung Carcinoma Subtypes;Ma;Phys Med Biol,2018

5. Comparisons of the Clinicopathological Features and Survival Outcomes Between Lung Cancer Patients With Adenocarcinoma and Squamous Cell Carcinoma;Fukui;Gen Thorac Cardiovasc Surg,2015

Cited by 13 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Enhancing Lung Cancer Survival Prediction: 3D CNN Analysis of CT Images Using Novel GTV1-SliceNum Feature and PEN-BCE Loss Function;Diagnostics;2024-06-20

2. Preoperative prediction of lymph node metastasis in colorectal cancer using ¹⁸F‐FDG PET/CT peritumoral radiomics analysis;Medical Physics;2024-05-27

3. Advancing NSCLC pathological subtype prediction with interpretable machine learning: a comprehensive radiomics-based approach;Frontiers in Medicine;2024-05-22

4. Machine Learning in Diagnosis and Prognosis of Lung Cancer by PET-CT;Cancer Management and Research;2024-04

5. Using tumor habitat-derived radiomic analysis during pretreatment 18F-FDG PET for predicting KRAS/NRAS/BRAF mutations in colorectal cancer;Cancer Imaging;2024-02-12