Diluvian Clustering: A Fast, Effective Algorithm for Clustering Compositional and Other Data-Reference-Cited by-同舟云学术

Diluvian Clustering: A Fast, Effective Algorithm for Clustering Compositional and Other Data

Published:2015-08-24 Issue:5 Volume:21 Page:1173-1183
ISSN:1431-9276
Container-title:Microscopy and Microanalysis
language:en
Short-container-title:Microsc Microanal

Author:

Ritchie Nicholas W. M.

Abstract

AbstractDiluvian Clustering is an unsupervised grid-based clustering algorithm well suited to interpreting large sets of noisy compositional data. The algorithm is notable for its ability to identify clusters that are either compact or diffuse and clusters that have either a large number or a small number of members. Diluvian Clustering is fundamentally different from most algorithms previously applied to cluster compositional data in that its implementation does not depend upon a metric. The algorithm reduces in two-dimensions to a case for which there is an intuitive, real-world parallel. Furthermore, the algorithm has few tunable parameters and these parameters have intuitive interpretations. By eliminating the dependence on an explicit metric, it is possible to derive reasonable clusters with disparate variances like those in real-world compositional data sets. The algorithm is computationally efficient. While the worst case scales as O(N2) most cases are closer to O(N) where N is the number of discrete data points. On a mid-range 2014 vintage computer, a typical 20,000 particle, 30 element data set can be clustered in a fraction of a second.

Publisher

Cambridge University Press (CUP)

Subject

Instrumentation

Reference15 articles.

1. Support-vector networks

2. Data Classification

3. Data Clustering: Theory, Algorithms, and Applications

Cited by 2 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Big Data Analytics for Remote Sensing: Concepts and Standards;Springer Remote Sensing/Photogrammetry;2023

2. Reproducible Spectrum and Hyperspectrum Data Analysis Using NeXL;Microscopy and Microanalysis;2022-03-02