High Utility Mining of Streaming Itemsets in Data Streams

Author:

Bokir Abdullah,Narasimha V B

Abstract

Abstract The traditional models for mining frequent itemsets mainly focus on the frequency of the items listed in the respective dataset. However, market basket analysis and other domains generally prefer utility obtained from items regardless of their frequencies in the transactions. One of the main options of utility in these domains could be profit. Therefore, it is significant to extract items that generate more profit than items that occurs more frequently in the dataset. Thus, mining high utility itemset has emerged recently as a prominent research topic in the field of data mining. Many of the existing researches have been proposed for mining high utility itemset from static data. However, with the recent advanced technologies, streaming data has become a good source for data in many applications. Mining high utility itemset over data streams is a more challenging task because of the uncertainty in data streams, processing time, and many more. Although some works have been proposed for mining high utility itemset over data streams, many of these works require multiple database scans and they require long processing time. In respect to this, we proposed a single-pass fast-search model in which we introduced a utility factor known as utility stream level for tracing the utility value of itemsets from data streams. The simulation study shows that the performance of the proposed model is more significant compared with the contemporary method. The comparison has been performed based on metrics like process-completion time and utilized search space.

Publisher

IOP Publishing

Subject

General Physics and Astronomy

Reference35 articles.

1. Mining weighted erasable patterns by using underestimated constraint-based pruning technique;A Lee;Journal of Intelligent & Fuzzy Systems,2015

2. Top-k high utility pattern mining with effective threshold raising strategies;Ryang;Knowledge-Based Systems

3. The smallest valid extension-based efficient, rare graph pattern mining, considering length-decreasing support constraints and symmetry characteristics of graphs;Yun;Symmetry,2016

4. CCSpan: Mining closed contiguous sequential patterns;Zhang;Knowledge-Based Systems

5. Approximate maximal frequent pattern mining with weight conditions and error tolerance;Lee;International Journal of Pattern Recognition and Artificial Intelligence,2016

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3