IPLog: An Efficient Log Parsing Method Based on Few-Shot Learning
-
Published:2024-08-21
Issue:16
Volume:13
Page:3324
-
ISSN:2079-9292
-
Container-title:Electronics
-
language:en
-
Short-container-title:Electronics
Author:
Liu Shuxian1, Yun Libo1, Nie Shuaiqi1, Zhang Guiheng1, Li Wei1
Affiliation:
1. School of Computer Science and Technology, Xinjiang University, Urumqi 830046, China
Abstract
Log messages from enterprise-level software systems contain crucial runtime details. Engineers can convert log messages into structured data through log parsing, laying the foundation for downstream tasks such as log anomaly detection. Existing log parsing schemes usually underperform in production environments for several reasons: first, they often ignore the semantics of log messages; second, they are often not adapted to different systems, and their performance varies greatly; and finally, they are difficult to adapt to the complexity and variety of log formats in the real environment. In response to the limitations of current approaches, we introduce IPLog (Intelligent Parse Log), a parsing method designed to address these issues. IPLog samples a limited set of log samples based on the distribution of templates in the system’s historical logs, and allows the model to make full use of the small number of log samples to recognize common patterns of keywords and parameters through few-shot learning, and thus can be easily adapted to different systems. In addition, IPLog can further improve the grouping accuracy of log templates through a novel manual feedback merge query strategy based on the longest common prefix, thus enhancing the model’s adaptability to handle complex log formats in production environments. We conducted experiments on four newly released public log datasets, and the experimental results show that IPLog can achieve an average grouping accuracy (GA) of 0.987 and parsing accuracy (PA) of 0.914 on the four public datasets, which are the best among the mainstream parsing schemes. These results demonstrate that IPLog is effective for log parsing tasks.
Funder
the National Natural Science Foundation of China College Scientific Research Project Plan of Xinjiang Autonomous Region
Reference34 articles.
1. Das, A., Mueller, F., Siegel, C., and Vishnu, A. (2018, January 11–15). Desh: Deep learning for system health prediction of lead times to failure in hpc. Proceedings of the 27th International Symposium on High-Performance Parallel and Distributed Computing, Tempe, AZ, USA. 2. Prefix: Switch failure prediction in datacenter networks;Zhang;Proceedings of the ACM on Measurement and Analysis of Computing Systems,2018 3. Zhang, X., Xu, Y., Lin, Q., Qiao, B., Zhang, H., Dang, Y., Xie, C., Yang, X., Cheng, Q., and Li, Z. (2019, January 26–30). Robust log-based anomaly detection on unstable log data. Proceedings of the ESEC/FSE 2019, Tallinn, Estonia. 4. Zhang, B., Zhang, H., Moscato, P., and Zhang, A. (2020, January 21–24). Anomaly detection via mining numerical workflow relations from logs. Proceedings of the 2020 International Symposium on Reliable Distributed Systems (SRDS), Shanghai, China. 5. Semi-supervised and unsupervised anomaly detection by mining numerical workflow relations from system logs;Zhang;Autom. Softw. Eng.,2023
|
|