Affiliation:
1. Instituto de Computação, Universidade de Federal de Alagoas (IC-UFAL), Maceió, Brazil
Abstract
Fuzzy C-means (FCM) clustering algorithm is an important and popular clustering algorithm which is utilized in various application domains such as pattern recognition, machine learning, and data mining. Although this algorithm has shown acceptable performance in diverse problems, the current literature does not have studies about how they can improve the clustering quality of partitions with overlapping classes. The better the clustering quality of a partition, the better is the interpretation of the data, which is essential to understand real problems. This work proposes two robust FCM algorithms to prevent ambiguous membership into clusters. For this, we compute two types of weights: an weight to avoid the problem of overlapping clusters; and other weight to enable the algorithm to identify clusters of different shapes. We perform a study with synthetic datasets, where each one contains classes of different shapes and different degrees of overlapping. Moreover, the study considered real application datasets. Our results indicate such weights are effective to reduce the ambiguity of membership assignments thus generating a better data interpretation.
Publisher
World Scientific Pub Co Pte Ltd
Subject
Artificial Intelligence,Information Systems,Control and Systems Engineering,Software
Reference52 articles.
1. I. H. Witten, E. Frank and M. A. Hall, Practical machine learning tools and techniques, Morgan Kaufmann (2005) 578.
2. The Survey of Data Mining Applications and Feature Scope
Cited by
7 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献