Affiliation:
1. University of Ottawa, Ottawa, Ontario, Canada
Abstract
It is often assumed that class imbalances are responsible for significant losses of performance in standard classifiers. The purpose of this paper is to the question whether class imbalances are truly responsible for this degradation or whether it can be explained in some other way. Our experiments suggest that the problem is not directly caused by class imbalances, but rather, that class imbalances may yield small disjuncts which, in turn, will cause degradation. We argue that, in order to improve classifier performance, it may, then, be more useful to focus on the small disjuncts problem than it is to focus on the class imbalance problem. We experiment with a method that takes the small disjunct problem into consideration, and show that, indeed, it yields a performance superior to the performance obtained using standard or advanced solutions to the class imbalance problem.
Publisher
Association for Computing Machinery (ACM)
Reference14 articles.
1. P. M. Murphy and D. W. Aha. UCI Repository of Machine Learning Databases. University California at Irvine Department of Information and Computer Science.]] P. M. Murphy and D. W. Aha. UCI Repository of Machine Learning Databases. University California at Irvine Department of Information and Computer Science.]]
Cited by
405 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献