A Distributed Algorithm for Fast Mining Frequent Patterns in Limited and Varying Network Bandwidth Environments-Reference-Cited by-同舟云学术

A Distributed Algorithm for Fast Mining Frequent Patterns in Limited and Varying Network Bandwidth Environments

Published:2019-05-06 Issue:9 Volume:9 Page:1859
ISSN:2076-3417
Container-title:Applied Sciences
language:en
Short-container-title:Applied Sciences

Author:

Lin Chun-Cheng^ORCID,Li Wei-Ching,Chen Ju-Chin,Chung Wen-Yu,Chung Sheng-Hao,Lin Kawuu W.^ORCID

Abstract

Data mining is a set of methods used to mine hidden information from data. It mainly includes frequent pattern mining, sequential pattern mining, classification, and clustering. Frequent pattern mining is used to discover the correlation among various sets of items within large databases. The rapid upward trend in data size slows the mining of frequent patterns. Numerous studies have attempted to develop algorithms that operate in distributed computing environments to accelerate the mining process. FLR-mining (Fast, Load balancing and Resource efficient mining algorithm) is one of the fastest methods of mining with efficient consideration of load balancing and resources. FLR-mining can automatically determine the appropriate number of computing nodes. However, FLR-mining and existing methods assume that the network bandwidth is constant. In practical distributed and many-task computing systems, this assumption fails because there are packet collisions caused by many mining tasks that run in a simultaneous manner. Therefore, a method that can consider the varying network bandwidth is necessary. In this study, we propose a method that can rapidly mine frequent patterns under the varying network bandwidth. The proposed method can also determine the appropriate number of computing nodes to efficiently utilize computing resources and achieve load balancing. Through empirical evaluation, the proposed method is shown to deliver excellent performance in terms of execution efficiency and load balancing.

Funder

Ministry of Science and Technology, R.O.C.

Publisher

MDPI AG

Subject

Fluid Flow and Transfer Processes,Computer Science Applications,Process Chemistry and Technology,General Engineering,Instrumentation,General Materials Science

Link

https://www.mdpi.com/2076-3417/9/9/1859/pdf

Reference30 articles.

1. A survey of sequential pattern mining;Fournier-Viger;Data Sci. Pattern Recognit.,2017

2. HUOPM: High-Utility Occupancy Pattern Mining

3. Distributed data mining in grid computing environments

Cited by 2 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Actionable Pattern-Driven Analytics and Prediction;Applied Sciences;2021-08-17

2. Network Intrusion Detection with a Hashing Based Apriori Algorithm Using Hadoop MapReduce;Computers;2019-12-02