Information criterion-based clustering with order-restricted candidate profiles in short time-course microarray experiments-Reference-Cited by-同舟云学术

Information criterion-based clustering with order-restricted candidate profiles in short time-course microarray experiments

Published:2009-05-15 Issue:1 Volume:10 Page:
ISSN:1471-2105
Container-title:BMC Bioinformatics
language:en
Short-container-title:BMC Bioinformatics

Author:

Liu Tianqing,Lin Nan,Shi Ningzhong,Zhang Baoxue

Abstract

Abstract Background Time-course microarray experiments produce vector gene expression profiles across a series of time points. Clustering genes based on these profiles is important in discovering functional related and co-regulated genes. Early developed clustering algorithms do not take advantage of the ordering in a time-course study, explicit use of which should allow more sensitive detection of genes that display a consistent pattern over time. Peddada et al. [1] proposed a clustering algorithm that can incorporate the temporal ordering using order-restricted statistical inference. This algorithm is, however, very time-consuming and hence inapplicable to most microarray experiments that contain a large number of genes. Its computational burden also imposes difficulty to assess the clustering reliability, which is a very important measure when clustering noisy microarray data. Results We propose a computationally efficient information criterion-based clustering algorithm, called ORICC, that also takes account of the ordering in time-course microarray experiments by embedding the order-restricted inference into a model selection framework. Genes are assigned to the profile which they best match determined by a newly proposed information criterion for order-restricted inference. In addition, we also developed a bootstrap procedure to assess ORICC's clustering reliability for every gene. Simulation studies show that the ORICC method is robust, always gives better clustering accuracy than Peddada's method and saves hundreds of times computational time. Under some scenarios, its accuracy is also better than some other existing clustering methods for short time-course microarray data, such as STEM [2] and Wang et al. [3]. It is also computationally much faster than Wang et al. [3]. Conclusion Our ORICC algorithm, which takes advantage of the temporal ordering in time-course microarray experiments, provides good clustering accuracy and is meanwhile much faster than Peddada's method. Moreover, the clustering reliability for each gene can also be assessed, which is unavailable in Peddada's method. In a real data example, the ORICC algorithm identifies new and interesting genes that previous analyses failed to reveal.

Publisher

Springer Science and Business Media LLC

Subject

Applied Mathematics,Computer Science Applications,Molecular Biology,Biochemistry,Structural Biology

Link

https://link.springer.com/content/pdf/10.1186/1471-2105-10-146.pdf

Reference45 articles.

1. Peddada S, Lobenhofer E, Li L, Afshari C, Weinberg C, Umbach D: Gene Selection and clustering for time-course and dose-response microarray experiments using order-restricted inference. Bioinformatics 2003, 19: 834–841.

2. Ernst J, Nau GJ, Bar-Joseph Z: Clustering short time series gene expression data. Bioinformatics 2005, 21(Suppl 1):i159-i168.

3. Wang L, Ramoni M, Sebastiani P: Clustering short gene expression profiles. Lecture Notes in Computer Science 2006, 3909: 60–68.

4. Gasch A, Spellman P, Kao C, Carmel-Harel O, Eisen M, Storz G, Botstein D, Brown P: Genomic expression programs in the response of yeast cells to environmental changes. Mol Biol Cell 2000, 11: 4241–4257.

5. Guillemin K, Salama N, Tompkins L, Falkow S: Cag pathogenicity island-specific responses of gastric epithelial cells to Helicobacter pylori infection. Proc Natl Acad Sci USA 2002, 99: 15136–15141.

Cited by 19 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Multi-omics integration with weighted affinity and self-diffusion applied for cancer subtypes identification;Journal of Translational Medicine;2024-01-19

2. Dissecting cancer heterogeneity based on dimension reduction of transcriptomic profiles using extreme learning machines;PLOS ONE;2018-09-14

3. Gene clustering for time-series microarray with production outputs;Soft Computing;2016-08-11

4. Checking unimodality using isotonic regression: an application to breast cancer mortality rates;Stochastic Environmental Research and Risk Assessment;2015-06-26

5. Shape-Output Gene Clustering for Time Series Microarrays;Advances in Intelligent Systems and Computing;2015