SiaLog: detecting anomalies in software execution logs using the siamese network-Reference-Cited by-同舟云学术

SiaLog: detecting anomalies in software execution logs using the siamese network

Published:2022-10-13 Issue:2 Volume:29 Page:
ISSN:0928-8910
Container-title:Automated Software Engineering
language:en
Short-container-title:Autom Softw Eng

Author:

Hashemi Shayan^ORCID,Mäntylä Mika

Abstract

AbstractDetecting anomalies in software logs has become a notable concern for software engineers and maintainers as they represent anomalies in software execution paths and states. This paper propose a novel anomaly detection approach based on the Siamese network on top of Recurrent Neural Networks(RNN). Accordingly, we introduce a novel training pair generation algorithm to train the Siamese network which reduces generated training significantly while maintaining the

$$F_1$$

F 1 score. Additionally, we propose a hybrid model by combining the Siamese network with a traditional feedforward neural network to make end-to-end training possible, reducing engineering effort in setting up a deep-learning-based log anomaly detector. Furthermore, we provides validations of the approach on the Hadoop Distributed File System (HDFS), Blue Gene/L (BGL), and Hadoop map-reduce task log datasets. To the best of our knowledge, the proposed approach outperforms other methods on the same dataset at the

$$F_1$$

F 1 scores of respectively 0.99, 0.99, and 0.94 on HDFS, BGL, and Hadoop datasets, resulting in a new state-of-the-art performance.To further evaluate the proposed method, we examine our method’s robustness to log evolutions by evaluating the model on synthetically evolved log sequences; we got the

$$F_1$$

F 1 score of 0.95 on the HDFS dataset at the noise ratio of

$$20\%$$

20 % . Finally, we dive deep into some of the side benefits of the Siamese network. Accordingly, we introduce an unsupervised log evolution monitoring method alongside a visualization technique that facilitates model interpretability.

Funder

Academy of Finland

University of Oulu including Oulu University Hospital

Publisher

Springer Science and Business Media LLC

Subject

Software

Link

https://link.springer.com/content/pdf/10.1007/s10515-022-00365-7.pdf

Reference47 articles.

1. Abdi, Hervé, Williams, Lynne J.: Principal component analysis. Wiley Interdiscip. Rev. Comput. Stat. 2(4), 433–459 (2010)

2. Ahrabian, Kian, BabaAli, Bagher: Usage of autoencoders and siamese networks for online handwritten signature verification. Neural Comput. Appl. 31(12), 9321–9334 (2019)

3. Alhersh, T., Stuckenschmidt, H.: On the combination of imu and optical flow for action recognition. In: 2019 IEEE International Conference on Pervasive Computing and Communications Workshops (PerCom Workshops), pages 17–21. IEEE, (2019)

4. Bahdanau, D., Cho, K., Bengio, Y.: Neural machine translation by jointly learning to align and translate. arXiv preprintarXiv:1409.0473, (2014)

5. Bertinetto, L., Valmadre, J., Henriques, J. F., Vedaldi, A., Torr, P. H.: Fully-convolutional siamese networks for object tracking. In: European conference on computer vision, pages 850–865. Springer, (2016)

Cited by 6 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. A literature review and existing challenges on software logging practices;Empirical Software Engineering;2024-06-18

2. OneLog: towards end-to-end software log anomaly detection;Automated Software Engineering;2024-04-16

3. LogPM: Character-Based Log Parser Benchmark;2024 IEEE International Conference on Software Analysis, Evolution and Reengineering (SANER);2024-03-12

4. Drug-Target-Interaction Prediction with Contrastive and Siamese Transformers;2023-10-31

5. LGLog: Semi-supervised Graph Representation Learning for Anomaly Detection based on System Logs;2023 IEEE 23rd International Conference on Software Quality, Reliability, and Security (QRS);2023-10-22