Breaking the hierarchy - a new cluster selection mechanism for hierarchical clustering methods-Reference-Cited by-同舟云学术

Breaking the hierarchy - a new cluster selection mechanism for hierarchical clustering methods

Published:2009-10-19 Issue:1 Volume:4 Page:
ISSN:1748-7188
Container-title:Algorithms for Molecular Biology
language:en
Short-container-title:Algorithms Mol Biol

Author:

Zahoránszky László A,Katona Gyula Y,Hári Péter,Málnási-Csizmadia András,Zweig Katharina A,Zahoránszky-Köhalmi Gergely

Abstract

Abstract Background Hierarchical clustering methods like Ward's method have been used since decades to understand biological and chemical data sets. In order to get a partition of the data set, it is necessary to choose an optimal level of the hierarchy by a so-called level selection algorithm. In 2005, a new kind of hierarchical clustering method was introduced by Palla et al. that differs in two ways from Ward's method: it can be used on data on which no full similarity matrix is defined and it can produce overlapping clusters, i.e., allow for multiple membership of items in clusters. These features are optimal for biological and chemical data sets but until now no level selection algorithm has been published for this method. Results In this article we provide a general selection scheme, the level independent clustering selection method, called LInCS. With it, clusters can be selected from any level in quadratic time with respect to the number of clusters. Since hierarchically clustered data is not necessarily associated with a similarity measure, the selection is based on a graph theoretic notion of cohesive clusters. We present results of our method on two data sets, a set of drug like molecules and set of protein-protein interaction (PPI) data. In both cases the method provides a clustering with very good sensitivity and specificity values according to a given reference clustering. Moreover, we can show for the PPI data set that our graph theoretic cohesiveness measure indeed chooses biologically homogeneous clusters and disregards inhomogeneous ones in most cases. We finally discuss how the method can be generalized to other hierarchical clustering methods to allow for a level independent cluster selection. Conclusion Using our new cluster selection method together with the method by Palla et al. provides a new interesting clustering mechanism that allows to compute overlapping clusters, which is especially valuable for biological and chemical data sets.

Publisher

Springer Science and Business Media LLC

Subject

Applied Mathematics,Computational Theory and Mathematics,Molecular Biology,Structural Biology

Link

https://link.springer.com/content/pdf/10.1186/1748-7188-4-12.pdf

Reference35 articles.

1. Downs GM, Willett P: Similarity searching and clustering of chemical-structure databases using molecular property data. J Chem Inf Comput Sci. 1994, 34: 1094-1102.

2. Willett P: Chemical similarity searching. J Chem Inf Comput Sci. 1998, 38: 983-996.

3. Wild DJ, Blankley CJ: Comparison of 2D fingerprint types and hierarchy level selection methods fo structural grouping using Ward's clustering. J Chem Inf Comput Sci. 2000, 40: 155-162.

4. Brown RD, Martin YC: Use of structure-activity data to compare structure-based clustering methods and descriptors for use in compound selection. J Chem Inf Comput Sci. 1996, 36: 572-584.

5. Ward JH: Hierarchical grouping to optimize an objective function. J Amer Statist Assoc. 1963, 58: 236-244. 10.2307/2282967.

Cited by 15 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Community Detection in Social Networks;Principles of Social Networking;2021-08-19

2. Modulation of Triple Artemisinin-Based Combination Therapy Pharmacodynamics by Plasmodium falciparum Genotype;ACS Pharmacology & Translational Science;2020-11-02

3. Modulation of triple artemisinin-based combination therapy pharmacodynamics by Plasmodium falciparum genotype;2020-07-03

4. SmartGraph: a network pharmacology investigation platform;Journal of Cheminformatics;2020-01-21

5. SmartGraph: A Network Pharmacology Investigation Platform;2019-07-19