Abstract
High utility sequential pattern (HUSP) mining aims to mine actionable patterns with high utilities, widely applied in real-world learning scenarios such as market basket analysis, scenic route planning and click-stream analysis. The existing HUSP mining algorithms mainly attempt to improve computation efficiency while maintaining the algorithm stability in the setting of large-scale data. Although these methods have made some progress, they ignore the relationship between additional items and underlying sequences, which directly leads to the generation of redundant sequential patterns sharing the same underlying sequence. Hence, the mined patterns’ actionability is limited, which significantly compromises the performance of patterns in real-world applications. To address this problem, we present a new method named Combined Utility-Association Sequential Pattern Mining (CUASPM) by incorporating item/sequence relations, which can effectively remove redundant patterns and extract high discriminative and strongly associated sequential pattern combinations with high utilities. Specifically, we introduce the concept of actionable combined mining into HUSP mining for the first time and develop a novel tree structure to select discriminative high utility sequential patterns (HUSPs) for downstream tasks. Furthermore, two efficient strategies (i.e., global and local strategies) are presented to facilitate mining HUSPs while guaranteeing utility growth and high levels of association. Last, two parameters are introduced to evaluate the interestingness of patterns to choose the most useful actionable combined HUSPs (ACHUSPs). Extensive experimental results demonstrate that the proposed CUASPM outperforms the baselines in terms of execution time, memory usage, mining high discriminative and strongly associated HUSPs.
Funder
National Natural Science Foundation of China
Natural Science Foundation of Shandong Province
Natural Science Foundation of Shandong province
Shandong Excellent Young Scientists Fund
Publisher
Public Library of Science (PLoS)
Reference44 articles.
1. Mining High Utility Sequential Patterns Using Multiple Minimum Utility;T Xu;Int J Pattern Recogn,2018
2. Mining high utility patterns in interval-based event sequences;SM Mirbagheri;Data Knowl Eng,2021
3. ProUM: Projection-based utility mining on sequence data;W Gan;Inform Sci,2020
4. Fast Utility Mining on Sequence Data;W Gan;IEEE Trans Cybern,2021
5. TKUS: Mining top-k high utility sequential patterns;C Zhang;Inform Sci,2021