Affiliation:
1. School of Management Science, Qufu Normal University, Rizhao 276800, China
Abstract
Bicluster mining has been frequently studied in the data mining field. Because column constant biclusters (CCB) can be transformed to be discriminative rules, they have been widely applied in various fields. However, no research on incrementally mining CCB has been reported in the literature. In real situations, due to the limitation of computation resources (such as memory), it is impossible to mine biclusters from very large datasets. Therefore, in this study, we propose an incremental mining CCB method. CCB can be deemed as a special case of frequent pattern (FP). Currently the most frequently used method for incrementally mining frequent patterns is FP tree based method. In this study, we innovatively propose an incremental mining CCB method with modified FP tree data structure. The technical contributions lie in two aspects. The first aspect is that we propose a modified FP tree data structure, namely Feature Value Sorting Frequent Pattern (FVSFP) tree that can be easily maintained. The second aspect is that we innovatively design a method for mining CCB from FVSFP tree. To verify the performance of the proposed method, it is tested on several datasets. Experimental results demonstrated that the proposed method has good performance for incrementally handling a newly added dataset.
Subject
Fluid Flow and Transfer Processes,Computer Science Applications,Process Chemistry and Technology,General Engineering,Instrumentation,General Materials Science
Reference35 articles.
1. Samir, R., El-Hennawy, H., and Elbadawy, H. (2023). Cluster-Based Multi-User Multi-Server Caching Mechanism in Beyond 5G/6G MEC. Sensors, 23.
2. Community Detection and Visualization in Complex Network by the Density-Canopy-Kmeans Algorithm and MDS Embedding;Li;IEEE Access,2019
3. Parallelized Evolutionary Learning for Detection of Biclusters in Gene Expression Data;Huang;IEEE/ACM Trans. Comput. Biol. Bioinform.,2012
4. Cheng, Y., and Church, G.M. (2000, January 19–23). Biclustering of expression data. Proceedings of the Eighth International Conference on Intelligent Systems for Molecular Biology, San Diego, CA, USA.
5. Cheng, H. (2008). Towards Accurate and Efficient Classification: A Discriminative and Frequent Pattern-Based Approach, University of Illinois. Technical Report.