Iterative Optimization and Simplification of Hierarchical Clusterings-Reference-Cited by-同舟云学术

Iterative Optimization and Simplification of Hierarchical Clusterings

Published:1996-04-01 Issue: Volume:4 Page:147-178
ISSN:1076-9757
Container-title:Journal of Artificial Intelligence Research
language:
Short-container-title:jair

Author:

Fisher D.

Abstract

Clustering is often used for discovering structure in data. Clustering systems differ in the objective function used to evaluate clustering quality and the control strategy used to search the space of clusterings. Ideally, the search strategy should consistently construct clusterings of high quality, but be computationally inexpensive as well. In general, we cannot have it both ways, but we can partition the search so that a system inexpensively constructs a `tentative' clustering for initial examination, followed by iterative optimization, which continues to search in background for improved clusterings. Given this motivation, we evaluate an inexpensive strategy for creating initial clusterings, coupled with several control strategies for iterative optimization, each of which repeatedly modifies an initial clustering in search of a better one. One of these methods appears novel as an iterative optimization strategy in clustering contexts. Once a clustering has been constructed it is judged by analysts -- often according to task-specific criteria. Several authors have abstracted these criteria and posited a generic performance task akin to pattern completion, where the error rate over completed patterns is used to `externally' judge clustering utility. Given this performance task, we adapt resampling-based pruning strategies used by supervised learning systems to the task of simplifying hierarchical clusterings, thus promising to ease post-clustering analysis. Finally, we propose a number of objective functions, based on attribute-selection measures for decision-tree induction, that might perform well on the error rate and simplicity dimensions.

Publisher

AI Access Foundation

Subject

Artificial Intelligence

Cited by 105 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Cluster Analysis in R With Big Data Applications;Research Anthology on Bioinformatics, Genomics, and Computational Biology;2023-12-29

2. Performance Evaluation of Data Stream Clustering Algorithm on Parameter Specification;The 6th International Conference on Wireless, Intelligent and Distributed Environment for Communication;2023-12-21

3. Have You Ever Seen a Robot? An Analysis of Children’s Drawings Between Technology and Science Fiction;Journal for STEM Education Research;2023-05-01

4. Identifying key elements for adequate simplifications of investment choices – The case of wind energy expansion;Energy Economics;2023-04

5. Data stream clustering for low-cost machines;Journal of Parallel and Distributed Computing;2022-08