Affiliation:
1. INESC-ID and Instituto Superior Técnico, Universidade de Lisboa, Portugal
2. LASIGE, Faculdade de Ciências, Universidade de Lisboa, Portugal
Abstract
Three-dimensional data are increasingly prevalent across biomedical and social domains. Notable examples are gene-sample-time, individual-feature-time, or node-node-time data, generally referred to as observation-attribute-context data. The unsupervised analysis of three-dimensional data can be pursued to discover putative biological modules, disease progression profiles, and communities of individuals with coherent behavior, among other patterns of interest. It is thus key to enhance the understanding of complex biological, individual, and societal systems. In this context, although clustering can be applied to group observations, its relevance is limited since observations in three-dimensional data domains are typically only meaningfully correlated on subspaces of the overall space. Biclustering tackles this challenge but disregards the third dimension. In this scenario, triclustering—the discovery of coherent subspaces within three-dimensional data—has been largely researched to tackle these problems. Despite the diversity of contributions in this field, there still lacks a structured view on the major requirements of triclustering, desirable forms of homogeneity (including coherency, structure, quality, locality, and orthonormality criteria), and algorithmic approaches. This work formalizes the triclustering task and its scope, introduces a taxonomy to categorize the contributions in the field, provides a comprehensive comparison of state-of-the-art triclustering algorithms according to their behavior and output, and lists relevant real-world applications. Finally, it highlights challenges and opportunities to advance the field of triclustering and its applicability to complex three-dimensional data analysis.
Funder
INESC-ID
LASIGE
Fundação para a Ciência e Tecnologia
Publisher
Association for Computing Machinery (ACM)
Subject
General Computer Science,Theoretical Computer Science
Reference119 articles.
1. Finding Hierarchies of Subspace Clusters
2. Faris Alqadah and Raj Bhatnagar. 2008. An effective algorithm for mining 3-clusters in vertically partitioned data. In IC on Information and Knowledge Management. ACM 1103--1112. 10.1145/1458082.1458228 Faris Alqadah and Raj Bhatnagar. 2008. An effective algorithm for mining 3-clusters in vertically partitioned data. In IC on Information and Knowledge Management. ACM 1103--1112. 10.1145/1458082.1458228
3. A hierarchical Bayesian model for flexible module discovery in three-way time-series data
4. Pleiades: Subspace Clustering and Evaluation
Cited by
47 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献