Affiliation:
1. Department of Computer Science, University of Engineering & Technology (UET), Lahore 54890, Pakistan
2. Artificial Intelligence and Data Analytics Laboratory, College of Computer and Information Sciences (CCIS), Prince Sultan University, Riyadh 11586, Saudi Arabia
Abstract
Modern distributed systems that operate concurrently generate interleaved logs. Identifiers (ID) are always associated with active instances or entities in order to track them in logs. Consequently, log messages with similar IDs can be categorized to aid in the localization and detection of anomalies. Current methods for achieving this are insufficient for overcoming the following obstacles: (1) Log processing is performed in a separate component apart from log mining. (2) In modern software systems, log format evolution is ongoing. It is hard to detect latent technical issues using simple monitoring techniques in a non-intrusive manner. Within the scope of this paper, we present a reliable and consistent method for the detection and localization of anomalies in interleaved unstructured logs in order to address the aforementioned drawbacks. This research examines Log Sequential Anomalies (LSA) for potential performance issues. In this study, IDs are used to group log messages, and ID relation graphs are constructed between distributed components. In addition to that, we offer a data-driven online log parser that does not require any parameters. By utilizing a novel log parser, the bundled log messages undergo a transformation process involving both semantic and temporal embedding. In order to identify instance–granularity anomalies, this study makes use of a heuristic searching technique and an attention-based Bi-LSTM model. The effectiveness, efficiency, and robustness of the paper are supported by the research that was performed on real-world datasets as well as on synthetic datasets. The neural network improves the F1 score by five percent, which is greater than other cutting-edge models.
Subject
Fluid Flow and Transfer Processes,Computer Science Applications,Process Chemistry and Technology,General Engineering,Instrumentation,General Materials Science
Reference56 articles.
1. (2023, January 08). Alibaba Cloud Reports IO Hang Error in North China. Available online: https://equalocean.com/technology/20190303-alibaba-cloud-reports-io-hang-error-in-north-china.
2. Toward fine-grained, unsupervised, scalable performance diagnosis for production cloud computing systems;Mi;IEEE Trans. Parallel And Distrib. Syst.,2013
3. Xu, W., Huang, L., Fox, A., Patterson, D., and Jordan, M.I. (2009, January 11–14). Detecting large-scale system problems by mining console logs. Proceedings of the ACM SIGOPS 22nd Symposium on Operating Systems Principles (SOSP’09), Big Sky, MT, USA.
4. Lou, J.-G., Fu, Q., Yang, S., Li, J., and Wu, B. (2010, January 25–28). Mining program workflow from interleaved traces. Proceedings of the 16th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (SIGKDD’10), Washington, DC, USA.
5. Cloudseer: Workflow monitoring of cloud infrastructures via interleaved logs;Yu;ACM SIGARCH Comput. Archit. News.,2016
Cited by
5 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献