MISFP-Growth: Hadoop-Based Frequent Pattern Mining with Multiple Item Support-Reference-Cited by-同舟云学术

MISFP-Growth: Hadoop-Based Frequent Pattern Mining with Multiple Item Support

Published:2019-05-20 Issue:10 Volume:9 Page:2075
ISSN:2076-3417
Container-title:Applied Sciences
language:en
Short-container-title:Applied Sciences

Author:

Wang Chen-Shu,Chang Jui-Yen

Abstract

In practice, single item support cannot comprehensively address the complexity of items in large datasets. In this study, we propose a big data analytics framework (named Multiple Item Support Frequent Patterns, MISFP-growth algorithm) that uses Hadoop-based parallel computing to achieve high-efficiency mining of itemsets with multiple item supports (MIS). The proposed architecture consists of two phases. First, in the counting support phase, a Hadoop MapReduce architecture is employed to determine the support for each item. Next, in the analytics phase, sub-transaction blocks are generated according to MIS and the MISFP-growth algorithm identifies the frequency of patterns. To facilitate decision makers in setting MIS, we also propose the concept of classification of item (COI), which classifies items of higher homogeneity into the same class, by which the items inherit class support as their item support. Three experiments were implemented to validate the proposed Hadoop-based MISFP-growth algorithm. The experimental results show approximately 38% reduction in the execution time on parallel architectures. The proposed MISFP-growth algorithm can be implemented on the distributed computing framework. Furthermore, according to the experimental results, the enhanced performance of the proposed algorithm indicates that it could have big data analytics applications.

Publisher

MDPI AG

Subject

Fluid Flow and Transfer Processes,Computer Science Applications,Process Chemistry and Technology,General Engineering,Instrumentation,General Materials Science

Link

https://www.mdpi.com/2076-3417/9/10/2075/pdf

Reference31 articles.

1. Storage Challenge: Where Will All That Big Data Go?

2. How organisations leverage Big Data: a maturity model

3. Data mining with big data;Wu;IEEE Trans. Knowl. Data Eng.,2014

4. A survey of sequential pattern mining;Fournier-Viger;Data Science and Pattern Recognition.,2017

5. Recent Development in Big Data Analytics for Business Operations and Risk Management

Cited by 11 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. High Dimensional Data Differential Privacy Protection Publishing Method Based on Association Analysis;Electronics;2023-06-23

2. Mining Interesting Negative Sequential Patterns Based on Influence;IEEE Access;2023

3. Parallel Frequent Subtrees Mining Method by an Effective Edge Division Strategy;Applied Sciences;2022-05-09

4. Application of Hadoop-Based Cloud Computing in Teaching Platform Research;Journal of Interconnection Networks;2022-01-28

5. Actionable Pattern-Driven Analytics and Prediction;Applied Sciences;2021-08-17