Abstract
Identifying and anticipating potential failures in the cloud is an effective method for increasing cloud reliability and proactive failure management. Many studies have been conducted to predict potential failure, but none have combined SMART (self-monitoring, analysis, and reporting technology) hard drive metrics with other system metrics, such as central processing unit (CPU) utilisation. Therefore, we propose a combined system metrics approach for failure prediction based on artificial intelligence to improve reliability. We tested over 100 cloud servers’ data and four artificial intelligence algorithms: random forest, gradient boosting, long short-term memory, and gated recurrent unit, and also performed correlation analysis. Our correlation analysis sheds light on the relationships that exist between system metrics and failure, and the experimental results demonstrate the advantages of combining system metrics, outperforming the state-of-the-art.
Subject
Artificial Intelligence,Computer Science Applications,Information Systems,Management Information Systems
Reference101 articles.
1. A manifesto for future generation cloud computing: Research directions for the next decade;ACM Comput. Surv. (CSUR),2018
2. LVRM: On the Design of Efficient Link Based Virtual Resource Management Algorithm for Cloud Platforms;IEEE Trans. Parallel Distrib. Syst.,2018
3. The construction of smart city information system based on the Internet of Things and cloud computing;Comput. Commun.,2020
4. Saini, H., Upadhyaya, A., and Khandelwal, M.K. (2019, January 13–14). Benefits of Cloud Computing for Business Enterprises: A Review. Proceedings of the International Conference on Advancements in Computing & Management (ICACM), Jaipur, India.
5. Emerging Solutions in Big Data and Cloud Technologies for Mobile Networks;Mob. Netw. Appl.,2019
Cited by
5 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献