Affiliation:
1. Rensselaer Polytechnic Institute, USA
2. Virginia Tech., USA
3. Microsoft Corp., USA
Abstract
Regulatory network analysis and other bioinformatics tasks require the ability to induce and represent arbitrary boolean expressions from data sources. In this paper, the authors introduce a novel framework called BLOSOM for mining (frequent) boolean expressions over binary-valued datasets. Boolean expressions can be grouped into four categories: pure conjunctions, pure disjunctions, conjunction of disjunctions, and disjunction of conjunctions. The authors’ main focus is on mining the simplest expressions (the minimal generators), but also to propose closure operators that yield closed (or unique maximal) boolean expressions. BLOSOM efficiently mines frequent boolean expressions by utilizing a number of methodical pruning techniques. Experiments showcase the behavior of BLOSOM for different input settings and parameter thresholds. Application studies on gene expression and gene regulation patterns showcase the effectiveness of this approach.
Reference30 articles.
1. Fast discovery of association rules;R.Agrawal;Advances in Knowledge Discovery and Data Mining,1996
2. Akutsu, T., Kuhara, S., Maruyama, O., & Miyano, S. (1998). Identification of gene regulatory networks by strategic gene disruptions and gene overexpressions. In Proceedings of the ACM-SIAM Symposium on Discrete Algorithms.
3. Antonie, M.-L., & Zaiane, O. (2004). Mining positive and negative association rules: An approach for confined rules. In Proceedings of the European PKDD Conf.
4. Mining frequent patterns with counting inference.;Y.Bastide;SIGKDD Explorations,2000
5. Bayardo, R. J., & Agrawal, R. (1999). Mining the most interesting rules. In Proceedings of the ACM SIGKDD Conf.
Cited by
4 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献