FPDclustering: a comprehensive R package for probabilistic distance clustering based methods-Reference-Cited by-同舟云学术

FPDclustering: a comprehensive R package for probabilistic distance clustering based methods

Published:2024-05-15 Issue: Volume: Page:
ISSN:0943-4062
Container-title:Computational Statistics
language:en
Short-container-title:Comput Stat

Author:

Tortora Cristina^ORCID,Palumbo Francesco

Abstract

AbstractData clustering has a long history and refers to a vast range of models and methods that exploit the ever-more-performing numerical optimization algorithms and are designed to find homogeneous groups of observations in data. In this framework, the probability distance clustering (PDC) family methods offer a numerically effective alternative to model-based clustering methods and a more flexible opportunity in the framework of geometric data clustering. Given nJ-dimensional data vectors arranged in a data matrix and the number K of clusters, PDC maximizes the joint density function that is defined as the sum of the products between the distance and the probability, both of which are measured for each data vector from each center. This article shows the capabilities of the PDC family, illustrating the package .

Funder

National Science Foundation

Publisher

Springer Science and Business Media LLC

Link

https://link.springer.com/content/pdf/10.1007/s00180-024-01490-5.pdf

Reference46 articles.

1. Aggarwal CC (2014) Data classification. Algorithms and applications. CRC Press Taylor and Francis Group, Boca Raton

2. Ahmad A, Khan SS (2019) Survey of state-of-the-art mixed data clustering algorithms. IEEE Access 7:31883–31902

3. Alivernini F, Lucidi F (2008) The Academic Motivation Scale (AMS): factorial structure, invariance and validity in the Italian context. Test Psychometr Methodol Appl Psychol 15(4):211–220

4. Ben-Israel A, Iyigun C (2008) Probabilistic d-clustering. J Classif 25(1):5–26

5. Bezdek JC (2013) Pattern recognition with fuzzy objective function algorithms. Springer, Berlin