Abstract
Frequent sequence pattern mining is an excellent tool to discover patterns in event chains. In complex systems, events from parallel processes are present, often without proper labelling. To identify the groups of events related to the subprocess, frequent sequential pattern mining can be applied. Since most algorithms provide too many frequent sequences that make it difficult to interpret the results, it is necessary to post-process the resulting frequent patterns. The available visualisation techniques do not allow easy access to multiple properties that support a faster and better understanding of the event scenarios. To answer this issue, our work proposes an intuitive and interactive solution to support this task, introducing three novel network-based sequence visualisation methods that can reduce the time of information processing from a cognitive perspective. The proposed visualisation methods offer a more information rich and easily understandable interpretation of sequential pattern mining results compared to the usual text-like outcome of pattern mining algorithms. The first uses the confidence values of the transitions to create a weighted network, while the second enriches the adjacency matrix based on the confidence values with similarities of the transitive nodes. The enriched matrix enables a similarity-based Multidimensional Scaling (MDS) projection of the sequences. The third method uses similarity measurement based on the overlap of the occurrences of the supporting events of the sequences. The applicability of the method is presented in an industrial alarm management problem and in the analysis of clickstreams of a website. The method was fully implemented in Python environment. The results show that the proposed methods are highly applicable for the interactive processing of frequent sequences, supporting the exploration of the inner mechanisms of complex systems.
Funder
Kulturális és Innovációs Minisztérium Nemzeti Kutatási Fejlesztési és Innovációs Alap
Publisher
Public Library of Science (PLoS)
Reference34 articles.
1. A study of sequential pattern mining algorithms for use in detection of user activity patterns;M Dunaev;Journal of Theoretical and Applied Information Technology,2018
2. A sequential pattern mining model for application workload prediction in cloud environment;Maryam Amiri;Journal of Network and Computer Applications. Elsevier,2018
3. MalSPM: Metamorphic malware behavior analysis and classification using sequential pattern mining;M Saqib Nawaz;Computers & Security. Elsevier,2022
4. Abonyi J, Károly R, Dörgö G. Event-Tree Based Sequence Mining Using LSTM Deep-Learning Model. Complexity; 2021.
5. Post-processing of association rules;B Baesens;DTEW Research Report,0020