A Comparison of Resampling Techniques for Medical Data Using Machine Learning-Reference-Cited by-同舟云学术

A Comparison of Resampling Techniques for Medical Data Using Machine Learning

Published:2020-03 Issue:01 Volume:19 Page:2040016
ISSN:0219-6492
Container-title:Journal of Information & Knowledge Management
language:en
Short-container-title:J. Info. Know. Mgmt.

Author:

Alahmari Fahad¹

Affiliation:

1. College of Computer Science, King Khalid University, Saudi Arabia

Abstract

Data imbalance with respect to the class labels has been recognised as a challenging problem for machine learning techniques as it has a direct impact on the classification model’s performance. In an imbalanced dataset, most of the instances belong to one class, while far fewer instances are associated with the remaining classes. Most of the machine learning algorithms tend to favour the majority class and ignore the minority classes leading to classification models being generated that cannot be generalised. This paper investigates the problem of class imbalance for a medical application related to autism spectrum disorder (ASD) screening to identify the ideal data resampling method that can stabilise classification performance. To achieve the aim, experimental analyses to measure the performance of different oversampling and under-sampling techniques have been conducted on a real imbalanced ASD dataset related to adults. The results produced by multiple classifiers on the considered datasets showed superiority in terms of specificity, sensitivity, and precision, among others, when adopting oversampling techniques in the pre-processing phase.

Publisher

World Scientific Pub Co Pte Lt

Subject

Library and Information Sciences,Computer Networks and Communications,Computer Science Applications

Link

https://www.worldscientific.com/doi/pdf/10.1142/S021964922040016X

Reference28 articles.

1. A Machine Learning Strategy for Autism Screening in Toddlers

2. Fuzzy Data Mining for Autism Classification of Children

3. Toward Brief “Red Flags” for Autism Screening: The Short Autism Spectrum Quotient and the Short Quantitative Checklist in 1,000 Cases and 3,000 Controls

Cited by 22 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Evaluation and benchmarking of hybrid machine learning models for autism spectrum disorder diagnosis using a 2-tuple linguistic neutrosophic fuzzy sets-based decision-making model;Neural Computing and Applications;2024-07-20

2. Artificial intelligence-driven prediction system for efficient management of Parlatoria Blanchardi in date palms;Multimedia Tools and Applications;2024-06-20

3. Fuzzy Evaluation and Benchmarking Framework for Robust Machine Learning Model in Real-Time Autism Triage Applications;International Journal of Computational Intelligence Systems;2024-06-17

4. Leveraging Sampling Schemes on Skewed Class Distribution to Enhance Male Fertility Detection with Ensemble AI Learners;International Journal of Pattern Recognition and Artificial Intelligence;2024-02

5. Assessment of Wind Shear Severity in Airport Runway Vicinity using Interpretable TabNet approach and Doppler LiDAR Data;Applied Artificial Intelligence;2024-01-10