Differentiating Gliosarcoma from Glioblastoma: A Novel Approach Using PEACE and XGBoost to Deal with Datasets with Ultra-High Dimensional Confounders
Author:
Saki Amir1, Faghihi Usef1ORCID, Baldé Ismaila2ORCID
Affiliation:
1. Département de Mathématiques et d’Informatique, Université du Québec à Trois-Rivières, Trois-Rivières, QC G8Z 4M3, Canada 2. Département de Mathématiques et de Statistique, Faculté des Sciences, Université de Moncton, Moncton, NB E1A3E9, Canada
Abstract
In this study, we used a recently developed causal methodology, called Probabilistic Easy Variational Causal Effect (PEACE), to distinguish gliosarcoma (GSM) from glioblastoma (GBM). Our approach uses a causal metric which combines Probabilistic Easy Variational Causal Effect (PEACE) with the XGBoost, or eXtreme Gradient Boosting, algorithm. Unlike prior research, which often relied on statistical models to reduce dataset dimensions before causal analysis, our approach uses the complete dataset with PEACE and the XGBoost algorithm. PEACE provides a comprehensive measurement of direct causal effects, applicable to both continuous and discrete variables. Our method provides both positive and negative versions of PEACE together with their averages to calculate the positive and negative causal effects of the radiomic features on the variable representing the type of tumor (GSM or GBM). In our model, PEACE and its variations are equipped with a degree d which varies from 0 to 1 and it reflects the importance of the rarity and frequency of the events. By using PEACE with XGBoost, we achieved a detailed and nuanced understanding of the causal relationships within the dataset features, facilitating accurate differentiation between GSM and GBM. To assess the XGBoost model, we used cross-validation and obtained a mean accuracy of 83% and an average model MSE of 0.130. This performance is notable given the high number of columns and low number of rows (code on GitHub).
Reference25 articles.
1. Glioblastoma: Morphologic and molecular genetic diversity;Miller;Arch. Pathol. Lab. Med.,2007 2. Epidemiology of brain tumors;Ohgaki;Cancer Epidemiol. Modif. Factors,2009 3. Genomic landscape of gliosarcoma: Distinguishing features and targetable alterations;Zaki;Sci. Rep.,2021 4. Ammari, S., Sallé de Chou, R., Assi, T., Touat, M., Chouzenoux, E., Quillent, A., Limkin, E., Dercle, L., Hadchiti, J., and Elhaik, M. (2021). Machine-learning-based radiomics MRI model for survival prediction of recurrent glioblastomas treated with bevacizumab. Diagnostics, 11. 5. Yang, Y., Fan, W., Gu, T., Yu, L., Chen, H., Lv, Y., Liu, H., Wang, G., and Zhang, D. (2021). Radiomic features of multi-ROI and multi-phase MRI for the prediction of microvascular invasion in solitary hepatocellular carcinoma. Front. Oncol., 11.
Cited by
1 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献
|
|