Comparative Analysis of ADASYN-SVM and SMOTE-SVM Methods on the Detection of Type 2 Diabetes Mellitus-Reference-Cited by-同舟云学术

Comparative Analysis of ADASYN-SVM and SMOTE-SVM Methods on the Detection of Type 2 Diabetes Mellitus

Published:2021-11-30 Issue:2 Volume:8 Page:276-282
ISSN:2460-0040
Container-title:Scientific Journal of Informatics
language:
Short-container-title:SJI

Author:

Ramadhan Nur Ghaniaviyanto

Abstract

Most people with diabetes in the world are type 2. We can detect diabetes early to prevent things that are not desirable by checking sugar and insulin levels with the doctor. In addition to using this method, people with diabetes can also be grouped based on data from diabetes examination results. However, most of the data on health examination results have several parameters that are difficult for the public to understand. These problems can be done by means of automatic classification. In addition to these problems, there is another problem in the form of an unbalanced amount of data for diabetics and non-diabetics. This problem can be done by balancing the amount of data using the model to increase the ratio of the amount of data that is small or decrease the ratio of the amount of data that is too much. Purpose: This study aims to detect type 2 diabetes mellitus using the SVM classification model and analyze the results of the comparison using the SMOTE and ADASYN data balancing technique which is the best. Methods/Study design/approach: The research method starts from collecting the diabetes dataset, then the dataset cleaning process is carried out whether there is a null value or not. After applying two oversampling methods to analyze which method is the most appropriate. After the oversampling technique was carried out, data classification was carried out using a support vector machine model to see the accuracy results. Result/Findings: The results obtained by the ADASYN-SVM method are superior to SMOTE-SVM. The ADASYNSVM method has an accuracy of 87.3%, while the SMOTE-SVM has an accuracy of 85.4%. Novelty/Originality/Value: The data used in this study came from the Karya Medika clinic, Indonesia which contains parameters related to type 2 diabetes.

Publisher

Universitas Negeri Semarang

Link

https://journal.unnes.ac.id/nju/index.php/sji/article/viewFile/32484/pdf

Cited by 15 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. PIF dataset: a comprehensive dataset of physiological and inertial features for recognition of human activities;Multimedia Tools and Applications;2024-05-02

2. Prediction of dementia based on older adults’ sleep disturbances using machine learning;Computers in Biology and Medicine;2024-03

3. Deep ensemble learning approach for lower limb movement recognition from multichannel sEMG signals;Neural Computing and Applications;2024-02-17

4. An Imbalanced Sequence Feature Extraction Approach for the Detection of LTE-R Cells with Degraded Communication Performance;Future Internet;2024-01-16

5. Chronic Diseases Prediction Using Machine Learning With Data Preprocessing Handling: A Critical Review;IEEE Access;2024