Judging the Quality of Gene Expression-Based Clustering Methods Using Gene Annotation-Reference-Cited by-同舟云学术

Judging the Quality of Gene Expression-Based Clustering Methods Using Gene Annotation

Published:2002-10-01 Issue:10 Volume:12 Page:1574-1581
ISSN:1088-9051
Container-title:Genome Research
language:en
Short-container-title:Genome Res.

Author:

Gibbons Francis D.,Roth Frederick P.

Abstract

We compare several commonly used expression-based gene clustering algorithms using a figure of merit based on the mutual information between cluster membership and known gene attributes. By studying various publicly available expression data sets we conclude that enrichment of clusters for biological function is, in general, highest at rather low cluster numbers. As a measure of dissimilarity between the expression patterns of two genes, no method outperforms Euclidean distance for ratio-based measurements, or Pearson distance for non-ratio-based measurements at the optimal choice of cluster number. We show the self-organized-map approach to be best for both measurement types at higher numbers of clusters. Clusters of genes derived from single- and average-linkage hierarchical clustering tend to produce worse-than-random results.[The algorithm described is available at http://llama.med.harvard.edu, under Software.]

Publisher

Cold Spring Harbor Laboratory

Subject

Genetics (clinical),Genetics

Reference41 articles.

1. Systematic Management and Analysis of Yeast Gene Expression Data

2. Angelo M. (1999) GeneCluster. (Whitehead/MIT Center for Genome Research, Cambridge, MA), http://www.genome.wi.mit.edu/cancer/software/software.html.

3. Gene Ontology: tool for the unification of biology

4. Beazley D.M. (2001) SWIG User's Manual v.1.3; http://www.swig.org.

5. Beazley D.M. Fletcher D. Dumont D. (1998) Perl extension building with SWIG. O'Reilly Perl Conference 2.0, San Jose, CA; http://www.swig.org/papers/Per198/swigperl.pdf.

Cited by 221 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. In vitro toxicity assessment of haloacetamides via a toxicogenomics assay;Environmental Toxicology and Pharmacology;2023-01

2. Very high-resolution satellite image segmentation using variable-length multi-objective genetic clustering for multi-class change detection;Journal of King Saud University - Computer and Information Sciences;2022-11

3. A Novel Calibration Step in Gene Co-Expression Network Construction;Frontiers in Bioinformatics;2021-11-23

4. Differential and Common Signatures of miRNA Expression and Methylation in Childhood Central Nervous System Malignancies: An Experimental and Computational Approach;Cancers;2021-10-31

5. Bioinformatic Analysis of Temporal and Spatial Proteome Alternations During Infections;Frontiers in Genetics;2021-07-02