A Novel Data Augmentation Method for Radiomics Analysis Using Image Perturbations
-
Published:2024-05-06
Issue:
Volume:
Page:
-
ISSN:2948-2933
-
Container-title:Journal of Imaging Informatics in Medicine
-
language:en
-
Short-container-title:J Digit Imaging. Inform. med.
Author:
Lo Iacono FORCID, Maragna R.ORCID, Pontone G.ORCID, Corino V. D. A.ORCID
Abstract
AbstractRadiomics extracts hundreds of features from medical images to quantitively characterize a region of interest (ROI). When applying radiomics, imbalanced or small dataset issues are commonly addressed using under or over-sampling, the latter being applied directly to the extracted features. Aim of this study is to propose a novel balancing and data augmentation technique by applying perturbations (erosion, dilation, contour randomization) to the ROI in cardiac computed tomography images. From the perturbed ROIs, radiomic features are extracted, thus creating additional samples. This approach was tested addressing the clinical problem of distinguishing cardiac amyloidosis (CA) from aortic stenosis (AS) and hypertrophic cardiomyopathy (HCM). Twenty-one CA, thirty-two AS and twenty-one HCM patients were included in the study. From each original and perturbed ROI, 107 radiomic features were extracted. The CA-AS dataset was balanced using the perturbation-based method along with random over-sampling, adaptive synthetic (ADASYN) and the synthetic minority oversampling technique (SMOTE). The same methods were tested to perform data augmentation dealing with CA and HCM. Features were submitted to robustness, redundancy, and relevance analysis testing five feature selection methods (p-value, least absolute shrinkage and selection operator (LASSO), semi-supervised LASSO, principal component analysis (PCA), semi-supervised PCA). Support vector machine performed the classification tasks, and its performance were evaluated by means of a 10-fold cross-validation. The perturbation-based approach provided the best performances in terms of f1 score and balanced accuracy in both CA-AS (f1 score: 80%, AUC: 0.91) and CA-HCM (f1 score: 86%, AUC: 0.92) classifications. These results suggest that ROI perturbations represent a powerful approach to address both data balancing and augmentation issues.
Funder
Politecnico di Milano
Publisher
Springer Science and Business Media LLC
Reference39 articles.
1. La Greca Saint-Esteven, A., Vuong, D., Tschanz, F., van Timmeren, J.E., Dal Bello, R., Waller, V., Pruschy, M., Guckenberger, M., Tanadini-Lang, S.: Systematic Review on the Association of Radiomics with Tumor Biological Endpoints. Cancers. 13, 3015 (2021). https://doi.org/10.3390/cancers13123015. 2. Corino, V.D.A., Montin, E., Messina, A., Casali, P.G., Gronchi, A., Marchianò, A., Mainardi, L.T.: Radiomic analysis of soft tissues sarcomas can distinguish intermediate from high-grade lesions. Journal of Magnetic Resonance Imaging. 47, 829–840 (2018). https://doi.org/10.1002/jmri.25791. 3. Kothari, G.: Role of radiomics in predicting immunotherapy response. Journal of Medical Imaging and Radiation Oncology. 66, 575–591 (2022). https://doi.org/10.1111/1754-9485.13426. 4. Bologna, M., Calareso, G., Resteghini, C., Sdao, S., Montin, E., Corino, V., Mainardi, L., Licitra, L., Bossi, P.: Relevance of apparent diffusion coefficient features for a radiomics-based prediction of response to induction chemotherapy in sinonasal cancer. NMR in Biomedicine. 35, e4265 (2022). https://doi.org/10.1002/nbm.4265. 5. Zhang, B., Ouyang, F., Gu, D., Dong, Y., Zhang, L., Mo, X., Huang, W., Zhang, S.: Advanced nasopharyngeal carcinoma: pre-treatment prediction of progression based on multi-parametric MRI radiomics. Oncotarget. 8, 72457–72465 (2017). https://doi.org/10.18632/oncotarget.19799.
|
|