An Empirical Investigation of Different Classifiers, Encoding, and Ensemble Schemes for Next Event Prediction Using Business Process Event Logs-Reference-Cited by-同舟云学术

An Empirical Investigation of Different Classifiers, Encoding, and Ensemble Schemes for Next Event Prediction Using Business Process Event Logs

Published:2020-12-31 Issue:6 Volume:11 Page:1-34
ISSN:2157-6904
Container-title:ACM Transactions on Intelligent Systems and Technology
language:en
Short-container-title:ACM Trans. Intell. Syst. Technol.

Author:

Tama Bayu Adhi¹^ORCID,Comuzzi Marco²^ORCID,Ko Jonghyeon²

Affiliation:

1. Department of Mechanical Engineering, Pohang University of Science and Technology (POSTECH)

2. School of Management Engineering, Ulsan National Institute of Science and Technology (UNIST)

Abstract

There is a growing need for empirical benchmarks that support researchers and practitioners in selecting the best machine learning technique for given prediction tasks. In this article, we consider the next event prediction task in business process predictive monitoring, and we extend our previously published benchmark by studying the impact on the performance of different encoding windows and of using ensemble schemes. The choice of whether to use ensembles and which scheme to use often depends on the type of data and classification task. While there is a general understanding that ensembles perform well in predictive monitoring of business processes, next event prediction is a task for which no other benchmarks involving ensembles are available. The proposed benchmark helps researchers to select a high-performing individual classifier or ensemble scheme given the variability at the case level of the event log under consideration. Experimental results show that choosing an optimal number of events for feature encoding is challenging, resulting in the need to consider each event log individually when selecting an optimal value. Ensemble schemes improve the performance of low-performing classifiers in this task, such as SVM, whereas high-performing classifiers, such as tree-based classifiers, are not better off when ensemble schemes are considered.

Publisher

Association for Computing Machinery (ACM)

Subject

Artificial Intelligence,Theoretical Computer Science

Link

https://dl.acm.org/doi/pdf/10.1145/3406541

Reference71 articles.

1. A comparative study on base classifiers in ensemble methods for credit scoring

2. Improving experimental studies about ensembles of classifiers for bankruptcy prediction and credit scoring

3. Building classification trees using the total uncertainty criterion

4. An introduction to kernel and nearest-neighbor nonparametric regression;Altman Naomi S.;Am. Stat.,1992

Cited by 7 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Predictive business process monitoring with AutoML for next activity prediction;Intelligent Decision Technologies;2024-07-23

2. Predictive Process Mining a Systematic Literature Review;Lecture Notes in Networks and Systems;2024

3. Automatically reconciling the trade-off between prediction accuracy and earliness in prescriptive business process monitoring;Information Systems;2023-09

4. Leveraging a Heterogeneous Ensemble Learning for Outcome-Based Predictive Monitoring Using Business Process Event Logs;Electronics;2022-08-15

5. Keeping our rivers clean: Information-theoretic online anomaly detection for streaming business process events;Information Systems;2022-02