Adversarial Approaches to Tackle Imbalanced Data in Machine Learning-Reference-Cited by-同舟云学术

Adversarial Approaches to Tackle Imbalanced Data in Machine Learning

Published:2023-04-24 Issue:9 Volume:15 Page:7097
ISSN:2071-1050
Container-title:Sustainability
language:en
Short-container-title:Sustainability

Author:

Ayoub Shahnawaz¹^ORCID,Gulzar Yonis²^ORCID,Rustamov Jaloliddin³^ORCID,Jabbari Abdoh⁴,Reegu Faheem Ahmad⁴^ORCID,Turaev Sherzod⁵^ORCID

Affiliation:

1. Department of Computer Science and Engineering, Shri Venkateshwara University, NH-24, Venkateshwara Nagar, Gajraula 244236, Uttar Pradesh, India

2. Department of Management Information Systems, College of Business Administration, King Faisal University, Al-Ahsa 31982, Saudi Arabia

3. Health Data Science Lab, Department of Genetics and Genomics, College of Medicine and Health Sciences, United Arab Emirates University, Al Ain 15551, United Arab Emirates

4. Department of Computer Science and Information Technology, Jazan University, Jazan 45142, Saudi Arabia

5. Department of Computer Science & Software Engineering, College of Information Technology, United Arab Emirates University, Al Ain 15551, United Arab Emirates

Abstract

Real-world applications often involve imbalanced datasets, which have different distributions of examples across various classes. When building a system that requires a high accuracy, the performance of the classifiers is crucial. However, imbalanced datasets can lead to a poor classification performance and conventional techniques, such as synthetic minority oversampling technique. As a result, this study proposed a balance between the datasets using adversarial learning methods such as generative adversarial networks. The model evaluated the effect of data augmentation on both the balanced and imbalanced datasets. The study evaluated the classification performance on three different datasets and applied data augmentation techniques to generate the synthetic data for the minority class. Before the augmentation, a decision tree was applied to identify the classification accuracy of all three datasets. The obtained classification accuracies were 79.9%, 94.1%, and 72.6%. A decision tree was used to evaluate the performance of the data augmentation, and the results showed that the proposed model achieved an accuracy of 82.7%, 95.7%, and 76% on a highly imbalanced dataset. This study demonstrates the potential of using data augmentation to improve the classification performance in imbalanced datasets.

Funder

United Arab Emirates University

Publisher

MDPI AG

Subject

Management, Monitoring, Policy and Law,Renewable Energy, Sustainability and the Environment,Geography, Planning and Development,Building and Construction

Link

https://www.mdpi.com/2071-1050/15/9/7097/pdf

Reference53 articles.

1. Mohammadzadeh, A., Sabzalian, M.H., Zhang, C., Castillo, O., Sakthivel, R., and El-Sousy, F.F.M. (2022). Modern Adaptive Fuzzy Control Systems, Springer Nature.

2. Investigation of Machine Learning Methods for Early Prediction of Neurodevelopmental Disorders in Children;Alam;Wirel. Commun. Mob. Comput.,2022

3. Gulzar, Y., and Khan, S.A. (2022). Skin Lesion Segmentation Based on Vision Transformers and Convolutional Neural Networks— A Comparative Study. Appl. Sci., 12.

4. Khan, S.A., Gulzar, Y., Turaev, S., and Peng, Y.S. (2021). A Modified HSIFT Descriptor for Medical Image Classification of Anatomy Objects. Symmetry, 13.

5. Minimized Computations of Deep Learning Technique for Early Diagnosis of Diabetic Retinopathy Using IoT-Based Medical Devices;Ayoub;Comput. Intell. Neurosci.,2022

Cited by 11 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Grid-Based Structural and Dimensional Skin Cancer Classification with Self-Featured Optimized Explainable Deep Convolutional Neural Networks;International Journal of Molecular Sciences;2024-01-26

2. Enhanced corn seed disease classification: leveraging MobileNetV2 with feature augmentation and transfer learning;Frontiers in Applied Mathematics and Statistics;2024-01-03

3. Adaptability of deep learning: datasets and strategies in fruit classification;BIO Web of Conferences;2024

4. Exploring Transfer Learning for Enhanced Seed Classification: Pre-trained Xception Model;Lecture Notes in Civil Engineering;2024

5. Least square-support vector machine based brain tumor classification system with multi model texture features;Frontiers in Applied Mathematics and Statistics;2023-12-06