Affiliation:
1. University of California San Diego, La Jolla, CA
Abstract
In recent years, configuration errors (i.e., misconfigurations) have become one of the dominant causes of system failures, resulting in many severe service outages and downtime. Unfortunately, it is notoriously difficult for system users (e.g., administrators and operators) to prevent, detect, and troubleshoot configuration errors due to the complexity of the configurations as well as the systems under configuration. As a result, the cost of resolving configuration errors is often tremendous from the aspects of both compensating the service disruptions and diagnosing, recovering from the failures. The prevalence, severity, and cost have made configuration errors one of the most thorny system problems that desire to be addressed.
This survey article provides a holistic and structured overview of the systems approaches that tackle configuration errors. To understand the problem fundamentally, we first discuss the characteristics of configuration errors and the challenges of tackling such errors. Then, we discuss the state-of-the-art systems approaches that address different types of configuration errors in different scenarios. Our primary goal is to equip the stakeholder with a better understanding of configuration errors and the potential solutions for resolving configuration errors in the spectrum of system development and management. To inspire follow-up research, we further discuss the open problems with regard to system configuration. To the best of our knowledge, this is the first survey on the topic of tackling configuration errors.
Publisher
Association for Computing Machinery (ACM)
Subject
General Computer Science,Theoretical Computer Science
Cited by
75 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献
1. Face It Yourselves: An LLM-Based Two-Stage Strategy to Localize Configuration Errors via Logs;Proceedings of the 33rd ACM SIGSOFT International Symposium on Software Testing and Analysis;2024-09-11
2. A case report of system configuration issue in medical imaging due to system upgrade– changes in hardware and software;Frontiers in Digital Health;2024-09-02
3. Rethinking DNS Configuration Verification with a Distributed Architecture;Proceedings of the 8th Asia-Pacific Workshop on Networking;2024-08-03
4. Ctest4J: A Practical Configuration Testing Framework for Java;Companion Proceedings of the 32nd ACM International Conference on the Foundations of Software Engineering;2024-07-10
5. Not as easy as just update: Survey of System Administrators and Patching Behaviours;Proceedings of the CHI Conference on Human Factors in Computing Systems;2024-05-11