Compositional mining of multirelational biological datasets-Reference-Cited by-同舟云学术

Compositional mining of multirelational biological datasets

Published:2008-03 Issue:1 Volume:2 Page:1-35
ISSN:1556-4681
Container-title:ACM Transactions on Knowledge Discovery from Data
language:en
Short-container-title:ACM Trans. Knowl. Discov. Data

Author:

Jin Ying¹,Murali T. M.¹,Ramakrishnan Naren¹

Affiliation:

1. Virginia Tech

Abstract

High-throughput biological screens are yielding ever-growing streams of information about multiple aspects of cellular activity. As more and more categories of datasets come online, there is a corresponding multitude of ways in which inferences can be chained across them, motivating the need for compositional data mining algorithms. In this article, we argue that such compositional data mining can be effectively realized by functionally cascading redescription mining and biclustering algorithms as primitives. Both these primitives mirror shifts of vocabulary that can be composed in arbitrary ways to create rich chains of inferences. Given a relational database and its schema, we show how the schema can be automatically compiled into a compositional data mining program, and how different domains in the schema can be related through logical sequences of biclustering and redescription invocations. This feature allows us to rapidly prototype new data mining applications, yielding greater understanding of scientific datasets. We describe two applications of compositional data mining: (i) matching terms across categories of the Gene Ontology and (ii) understanding the molecular mechanisms underlying stress response in human cells.

Publisher

Association for Computing Machinery (ACM)

Subject

General Computer Science

Link

https://dl.acm.org/doi/pdf/10.1145/1342320.1342322

Reference56 articles.

1. Mining Chains of Relations

2. Automatic Subspace Clustering of High Dimensional Data

3. Gene Ontology: tool for the unification of biology

4. BIND: the Biomolecular Interaction Network Database

Cited by 8 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Conceptual Coverage Driven by Essential Concepts: A Formal Concept Analysis Approach;Mathematics;2021-10-23

2. Interactive Discovery of Coordinated Relationship Chains with Maximum Entropy Models;ACM Transactions on Knowledge Discovery from Data;2018-02-28

3. What Is Redescription Mining;Redescription Mining;2017

4. The human is the loop: new directions for visual analytics;Journal of Intelligent Information Systems;2014-01-28

5. Mining Multiple Related Data Sources Using Object-Oriented Model;Transactions on Large-Scale Data- and Knowledge-Centered Systems XIII;2014