Reclassification as Supervised Clustering-Reference-Cited by-同舟云学术

Reclassification as Supervised Clustering

Published:2000-11-01 Issue:11 Volume:12 Page:2537-2546
ISSN:0899-7667
Container-title:Neural Computation
language:en
Short-container-title:Neural Computation

Author:

Sierra A.¹,Corbacho F.¹

Affiliation:

1. Escuela Técnica Superior de Informática, Universidad Autónoma de Madrid, 28049 Madrid, Spain

Abstract

In some branches of science, such as molecular biology, classes may be defined but not completely trusted. Sometimes posterior analysis proves them to be partially incorrect. Despite its relevance, this phenomenon has not received much attention within the neural computation community. We define reclassification as the task of redefining some given classes by maximum likelihood learning in a model that contains both supervised and unsupervised information. This approach leads to supervised clustering with an additional complexity penalizing term on the number of new classes. As a proof of concept, a simple reclassification algorithm is designed and applied to a data set of gene sequences. To test the performance of the algorithm, two of the original classes are merged. The algorithm is capable of unraveling the original three-class hidden structure, in contrast to the unsupervised version (K-means); moreover, it predicts the subdivision of one of the original classes into two different ones.

Publisher

MIT Press - Journals

Subject

Cognitive Neuroscience,Arts and Humanities (miscellaneous)

Link

https://www.mitpressjournals.org/doi/pdf/10.1162/089976600300014836

Reference7 articles.

1. Supervised adaptive clustering: A hybrid neural network clustering algorithm

2. Semi-Supervised Point Prototype Clustering

3. Reclassification of Actaea to include Cimicifuga and Souliea (Ranunculaceae): phytogeny inferred from morphology, nrDNA ITS, and cpDNA trn L‐F sequence variation

4. Counterpropagation networks

5. Connectionist learning procedures

Cited by 8 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. A robust approach based on Weibull distribution for clustering gene expression data;Algorithms for Molecular Biology;2011-05-31

2. References;Building Intelligent Interactive Tutors;2009

3. Emergent unsupervised clustering paradigms with potential application to bioinformatics;Frontiers in Bioscience;2008

4. MUTUAL INFORMATION CLUSTERING FOR EFFICIENT MINING OF FUZZY ASSOCIATION RULES WITH APPLICATION TO GENE EXPRESSION DATA ANALYSIS;International Journal on Artificial Intelligence Tools;2006-04

5. KERNEL-BASED SELF-ORGANIZED MAPS TRAINED WITH SUPERVISED BIAS FOR GENE EXPRESSION DATA ANALYSIS;Journal of Bioinformatics and Computational Biology;2004-01