Implications of resampling data to address the class imbalance problem (IRCIP): an evaluation of impact on performance between classification algorithms in medical data-Reference-Cited by-同舟云学术

Implications of resampling data to address the class imbalance problem (IRCIP): an evaluation of impact on performance between classification algorithms in medical data

Published:2023-04-06 Issue:2 Volume:6 Page:
ISSN:2574-2531
Container-title:JAMIA Open
language:en
Short-container-title:

Author:

Welvaars Koen¹^ORCID,Oosterhoff Jacobien H F²,van den Bekerom Michel P J³⁴,Doornberg Job N⁵,van Haarst Ernst P⁶,van der Zee J A,van Andel G A,Lagerveld B W,Hovius M C,Kauer P C,Boevé L M S,van der Kuit A,Mallee W,Poolman R,

Affiliation:

1. Data Science Team, OLVG , Amsterdam, The Netherlands

2. Department of Engineering Systems & Services, Faculty Technology Policy and Management, Delft University of Technology , Delft, The Netherlands

3. Department of Orthopaedic Surgery, OLVG , Amsterdam, the Netherlands

4. Faculty of Behavioural and Movement Sciences, Vrije Universiteit , Amsterdam, the Netherlands

5. Department of Orthopaedic Surgery, UMCG , Groningen, the Netherlands

6. Department of Urology, OLVG , Amsterdam, the Netherlands

Abstract

Abstract Objective When correcting for the “class imbalance” problem in medical data, the effects of resampling applied on classifier algorithms remain unclear. We examined the effect on performance over several combinations of classifiers and resampling ratios. Materials and Methods Multiple classification algorithms were trained on 7 resampled datasets: no correction, random undersampling, 4 ratios of Synthetic Minority Oversampling Technique (SMOTE), and random oversampling with the Adaptive Synthetic algorithm (ADASYN). Performance was evaluated in Area Under the Curve (AUC), precision, recall, Brier score, and calibration metrics. A case study on prediction modeling for 30-day unplanned readmissions in previously admitted Urology patients was presented. Results For most algorithms, using resampled data showed a significant increase in AUC and precision, ranging from 0.74 (CI: 0.69–0.79) to 0.93 (CI: 0.92–0.94), and 0.35 (CI: 0.12–0.58) to 0.86 (CI: 0.81–0.92) respectively. All classification algorithms showed significant increases in recall, and significant decreases in Brier score with distorted calibration overestimating positives. Discussion Imbalance correction resulted in an overall improved performance, yet poorly calibrated models. There can still be clinical utility due to a strong discriminating performance, specifically when predicting only low and high risk cases is clinically more relevant. Conclusion Resampling data resulted in increased performances in classification algorithms, yet produced an overestimation of positive predictions. Based on the findings from our case study, a thoughtful predefinition of the clinical prediction task may guide the use of resampling techniques in future studies aiming to improve clinical decision support tools.

Funder

OLVG Urology Consortium

Publisher

Oxford University Press (OUP)

Subject

Health Informatics

Link

https://academic.oup.com/jamiaopen/article-pdf/6/2/ooad033/50497726/ooad033.pdf

Reference20 articles.

1. The class imbalance problem;Megahed;Nat Methods,2021

2. Learning from Imbalanced Data Sets

3. An empirical evaluation of sampling methods for the classification of imbalanced data;Kim;PLoS One,2022