A New Clustering Method Based on the Inversion Formula-Reference-Cited by-同舟云学术

A New Clustering Method Based on the Inversion Formula

Published:2022-07-22 Issue:15 Volume:10 Page:2559
ISSN:2227-7390
Container-title:Mathematics
language:en
Short-container-title:Mathematics

Author:

Lukauskas Mantas^ORCID,Ruzgas Tomas^ORCID

Abstract

Data clustering is one area of data mining that falls into the data mining class of unsupervised learning. Cluster analysis divides data into different classes by discovering the internal structure of data set objects and their relationship. This paper presented a new density clustering method based on the modified inversion formula density estimation. This new method should allow one to improve the performance and robustness of the k-means, Gaussian mixture model, and other methods. The primary process of the proposed clustering algorithm consists of three main steps. Firstly, we initialized parameters and generated a T matrix. Secondly, we estimated the densities of each point and cluster. Third, we updated mean, sigma, and phi matrices. The new method based on the inversion formula works quite well with different datasets compared with K-means, Gaussian Mixture Model, and Bayesian Gaussian Mixture model. On the other hand, new methods have limitations because this one method in the current state cannot work with higher-dimensional data (d > 15). This will be solved in the future versions of the model, detailed further in future work. Additionally, based on the results, we can see that the MIDEv2 method works the best with generated data with outliers in all datasets (0.5%, 1%, 2%, 4% outliers). The interesting point is that a new method based on the inversion formula can cluster the data even if data do not have outliers; one of the most popular, for example, is the Iris dataset.

Publisher

MDPI AG

Subject

General Mathematics,Engineering (miscellaneous),Computer Science (miscellaneous)

Link

https://www.mdpi.com/2227-7390/10/15/2559/pdf

Reference45 articles.

1. A semi-supervised approximate spectral clustering algorithm based on HMRF model

2. View-Based 3-D Model Retrieval: A Benchmark

3. Modeling Temporal Information of Mitotic for Mitotic Event Detection

4. Deep learning-based clustering approaches for bioinformatics

Cited by 8 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. New clusterization of global seaport countries based on their DEA and FDEA network efficiency scores;PLOS ONE;2024-07-30

2. Research on Resident Behavioral Activities Based on Social Media Data: A Case Study of Four Typical Communities in Beijing;Information;2024-07-05

3. Specification Mining Based on the Ordering Points to Identify the Clustering Structure Clustering Algorithm and Model Checking;Algorithms;2024-01-10

4. Enhancing Skills Demand Understanding through Job Ad Segmentation Using NLP and Clustering Techniques;Applied Sciences;2023-05-16

5. Effective Incomplete Multi-View Clustering via Low-Rank Graph Tensor Completion;Mathematics;2023-01-28