Affiliation:
1. Department of Computer Science and Engineering, North Minzu University, NingXia, China
Abstract
High utility itemset mining (HUIM) with negative utility is an emerging data mining task. However, the setting of the minimum utility threshold is always a challenge when mining high utility itemsets (HUIs) with negative items. Although the top-k HUIM method is very common, this method can only mine itemsets with positive items, and the problem of missing itemsets occurs when mining itemsets with negative items. To solve this problem, we first propose an effective algorithm called THN (Top-k High Utility Itemset Mining with Negative Utility). It proposes a strategy for automatically increasing the minimum utility threshold. In order to solve the problem of multiple scans of the database, it uses transaction merging and dataset projection technology. It uses a redefined sub-tree utility value and a redefined local utility value to prune the search space. Experimental results on real datasets show that THN is efficient in terms of runtime and memory usage, and has excellent scalability. Moreover, experiments show that THN performs particularly well on dense datasets.
Subject
Artificial Intelligence,General Engineering,Statistics and Probability
Reference24 articles.
1. Agrawal R. and Srikant R. , Fast algorithms for mining association rules, In Proc. of the 20th Int’l Conf. on Very Large Data Bases (1994), 487–499.
2. A two-phase algorithm for fast discovery of high utility itemset;Liu;Advances in Knowledge Discovery and Data Mining, the 9th Pacific-Asia Conference. PAKDD
3. Efficient Tree Structures for High utility Pattern Mining in Incremental Databases;Ahmed;IEEE Transactions on Knowledge and Data Engineering,2009
4. Isolated Items Discarding Strategy for Discovering High utility Itemsets;Li;&Knowledge Engineering,2008
5. Efficient algorithms for mining high utility itemsets from transactional databases;Tseng;IEEE Transactions on Knowledge and Data Engineering,2013
Cited by
12 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献