Affiliation:
1. Department of Computer Science, College of Computer Science, National Chiao Tung University, Hsinchu City, Taiwan
Abstract
High utility sequential pattern (HUSP) mining is an emerging topic in pattern mining, and only a few algorithms have been proposed to address it. In practice, most sequence databases usually grow over time, and it is inefficient for existing algorithms to mine HUSPs from scratch when databases grow with a small portion of updates. In view of this, we propose the IncUSP-Miner
+
algorithm to mine HUSPs incrementally. Specifically, to avoid redundant re-computations, we propose a tighter upper bound of the utility of a sequence, called Tight Sequence Utility (TSU), and then we design a novel data structure, called the candidate pattern tree, to buffer the sequences whose TSU values are greater than or equal to the minimum utility threshold in the original database. Accordingly, to avoid keeping a huge amount of utility information for each sequence, a set of concise utility information is designed to be stored in each tree node. To improve the mining efficiency, several strategies are proposed to reduce the amount of computation for utility update and the scopes of database scans. Moreover, several strategies are also proposed to properly adjust the candidate pattern tree for the support of multiple database updates. Experimental results on some real and synthetic datasets show that IncUSP-Miner
+
is able to efficiently mine HUSPs incrementally.
Funder
Ministry of Science and Technology, Taiwan
Publisher
Association for Computing Machinery (ACM)
Subject
Artificial Intelligence,Theoretical Computer Science
Cited by
20 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献
1. Mining frequent temporal duration-based patterns on time interval sequential database;Information Sciences;2024-04
2. A survey of high utility sequential patterns mining methods;Journal of Intelligent & Fuzzy Systems;2023-11-04
3. Incremental Targeted Mining in Sequences;2023 IEEE 10th International Conference on Data Science and Advanced Analytics (DSAA);2023-10-09
4. HUSP-SP: Faster Utility Mining on Sequence Data;ACM Transactions on Knowledge Discovery from Data;2023-08-10
5. Performance Evaluation of Vedic Multiplier Using Hybrid Improvised High Utility Item Set Mining Using Fuzzy;2023 International Conference on Advances in Computing, Communication and Applied Informatics (ACCAI);2023-05-25