An Oversampling Mechanism for Multimajority Datasets using SMOTE and Darwinian Particle Swarm Optimisation
-
Published:2023-03-10
Issue:2
Volume:11
Page:143-153
-
ISSN:2321-8169
-
Container-title:International Journal on Recent and Innovation Trends in Computing and Communication
-
language:
-
Short-container-title:IJRITCC
Author:
Mary Mathew Rose,Gunasundari R.
Abstract
Data skewness continues to be one of the leading factors which adversely impacts the machine learning algorithms performance. An approach to reduce this negative effect of the data variance is to pre-process the former dataset with data level resampling strategies. Resampling strategies have been seen in two forms, oversampling and undersampling. An oversampling strategy is proposed in this article for tackling multiclass imbalanced datasets. This proposed approach optimises the state-of-the-art oversampling technique SMOTE with the Darwinian Particle Swarm Optimization technique. This proposed method DOSMOTE generates synthetic optimised samples for balancing the datasets. This strategy will be more effective on multimajority datasets. An experimental study is performed on peculiar multimajority datasets to measure the effectiveness of the proposed approach. As a result, the proposed method produces promising results when compared to the conventional oversampling strategies.
Publisher
Auricle Technologies, Pvt., Ltd.
Subject
Electrical and Electronic Engineering,Software,Information Systems,Human-Computer Interaction,Computer Networks and Communications
Cited by
1 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献
1. Classification of Ambient Noises in Signals Using 2D Fully Convolutional Neural Network;2023 International Conference on Data Science and Network Security (ICDSNS);2023-07-28