Affiliation:
1. Department of Information Engineering and Computer Science, Feng Chia University, Taiwan
2. Department of Information Management, Chaoyang University of Technology, Taiwan
Abstract
Many modern applications such as sensor networks produce probabilistic data. These data are collected into an uncertain database. To interpret uncertainty and to mine frequent patterns in an uncertain database, all possible certain databases are considered, which generates an exponential number of combinations and makes the mining problem highly complicated. In practice, mining is interactive, which makes the discovery of frequent itemsets in an uncertain database even more challenging. The objective of interactive mining is to shorten the time that is required to obtain the desired patterns in the iterated lengthy mining process. The time-consuming mining process in an uncertain database is exacerbated by repeated processing if the mining is performed from scratch. Therefore, we propose an interactive mining algorithm called iDIP to solve this problem. The iDIP algorithm adopts an approximation mechanism to mine the patterns and prunes candidates by using the existing patterns. Comprehensive experiments using both real and synthetic datasets show that iDIP outperforms the well-known re-mining-based MB algorithm for 4.3 times faster on average. In addition, iDIP has good linear scalability.
Funder
Ministry of Science and Technology, Taiwan
Publisher
World Scientific Pub Co Pte Ltd
Subject
Artificial Intelligence,Information Systems,Control and Systems Engineering,Software
Cited by
3 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献