Affiliation:
1. School of Electrical Engineering and Information Engineering Lanzhou University of Technology Lanzhou China
2. Gansu Key Laboratory of Advanced Control of Industrial Processes Lanzhou University of Technology Lanzhou China
3. National Electrical and Control Engineering Experimental Teaching Center Lanzhou University of Technology Lanzhou China
Abstract
AbstractTo address the problems of pattern collapse, uncontrollable data generation and high overlap rate when generative adversarial network (GAN) oversamples imbalanced data, we propose an imbalanced data oversampling algorithm based on improved dual discriminator generative adversarial nets (D2GAN). First, we integrate the positive class attribute information into the generator and the discriminator to ensure that the generator only generates the samples for positive class samples, which overcomes the problem of uncontrollable data generation by the generator. Second, we introduce a classifier into D2GAN for discriminating the generated samples and the original data, which avoids the overlap among the generated samples and the negative class samples, and ensures the diversity of the generated samples, the problem of pattern collapse is solved. Finally, the performance of the proposed algorithm is evaluated on 9 datasets by using SVM and neural network classification algorithm for oversampling experiments, the results show that the proposed algorithm effectively improve the classification performance of imbalanced data.
Funder
Science and Technology Program of Gansu Province
Subject
Computer Science Applications,Information Systems,Analysis