Handling Imbalanced Class Problem of Measles InfectionRisk Prediction Model
-
Published:2019-10-30
Issue:1
Volume:9
Page:3431-3435
-
ISSN:2249-8958
-
Container-title:International Journal of Engineering and Advanced Technology
-
language:
-
Short-container-title:IJEAT
Author:
Ahmad* Wan MuhamadTaufik Wan, ,Ghani Nur Laila Ab,Drus Sulfeeza Mohd, ,
Abstract
Measles is an emerging infectious disease with increasing number of reported cases. It is a vaccine-preventable disease;thus, it is common to have imbalanced class problem in the dataset. This study aims to resolve the imbalanced class problem for the prediction of measles infection risk and to compare the predictive results on a balanced dataset based on three machine learningtechniques. The data that was utilized in this study contained 37,884 records of suspected measles casesthat were highly imbalanced towards negative measles cases. The Synthetic Minority Over-Sampling Technique (SMOTE) was performed to balance thedistribution of the target attribute. The balanced dataset was then modelled using logistic regression, decision tree and Naïve Bayes. The predicted results indicated that logistic regression executed on the balanced dataset by SMOTE has the highest and most accurateclassification with 94.5% overall accuracy, 93.9% true positive rate, 5.8% false positive rate and 5.1% false negative rate. Therefore, SMOTE and other over-sampling approaches may be applicable to overcome imbalanced class issues in the medical dataset.
Publisher
Blue Eyes Intelligence Engineering and Sciences Engineering and Sciences Publication - BEIESP
Subject
Computer Science Applications,General Engineering,Environmental Engineering
Cited by
1 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献