Affiliation:
1. BIBA-Bremer Institut für Produktion und Logistik GmbH, Hochschulring 20, D-28359 Bremen, Germany
2. Department of Integrated Product Development, University of Bremen, Bibliothekstraße 1, D-28359 Bremen, Germany
Abstract
Within the integration and development of data-driven process models, the underlying process is digitally mapped in a model through sensory data acquisition and subsequent modelling. In this process, challenges of different types and degrees of severity arise in each modelling step, according to the Cross-Industry Standard Process for Data Mining (CRISP-DM). Particularly in the context of data acquisition and integration into the process model, it can be assumed with a sufficiently high degree of probability that the acquired data contain anomalies of various kinds. The outliers must be detected in the data preparation and processing phase and dealt with accordingly. If this is sufficiently implemented, it will positively impact the subsequent modelling in terms of accuracy and precision. Therefore, this paper shows how outliers can be identified using the unsupervised machine learning methods autoencoder, Density-Based Spatial Clustering of Applications with Noise (DBSCAN), Isolation Forest (iForest), and One-Class Support Vector Machine (OCSVM). Following implementing these methods, we compared them by applying the Numenta Anomaly Benchmark (NAB) and sufficiently presented the individual strengths and disadvantages. Evaluating the correctness, distinctiveness and robustness criteria described in the paper showed that the One-Class Support Vector Machine was outstanding among the methods considered. This is because the OCSVM achieved acceptable anomaly detections on the available process datasets with comparatively little effort.
Funder
German Federal Ministry for Digital and Transport (BMDV) in the ”Innovative Port Technologies” (IHATEC II) program
Subject
Computer Networks and Communications,Human-Computer Interaction
Reference47 articles.
1. Smart Use Case Picking with DUCAR: A Hands-On Approach for a Successful Integration of Machine Learning in Production Processes;Mayr;Procedia Manuf.,2020
2. Outlier detection: Applications and techniques;Singh;Int. J. Comput. Sci. Issues (IJCSI),2012
3. Schindler, T.F., Bode, D., and Thoben, K.D. (2022, January 7–9). Towards Challenges and Proposals for Integrating and Using Machine Learning Methods in Production Environments. Proceedings of the International Conference on System-Integrated Intelligence, Genova, Italy.
4. Lavin, A., and Ahmad, S. (2015, January 9–11). Evaluating Real-Time Anomaly Detection Algorithms – The Numenta Anomaly Benchmark. Proceedings of the 2015 IEEE 14th International Conference on Machine Learning and Applications (ICMLA), Miami, FL, USA.
5. Freeman, C., Merriman, J., Beavers, I., and Mueen, A. (2019, January 19–22). Experimental Comparison of Online Anomaly Detection Algorithms. Proceedings of the Thirty-Second International Flairs Conference, Sarasota, FL, USA.
Cited by
1 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献
1. Comparison of Dimensional Reduction Methods for Predictive Analysis of Railway System Data;2024 9th International Conference on Smart and Sustainable Technologies (SpliTech);2024-06-25