A novel approach to workload prediction using attention-based LSTM encoder-decoder network in cloud environment-Reference-Cited by-同舟云学术

A novel approach to workload prediction using attention-based LSTM encoder-decoder network in cloud environment

Published:2019-12 Issue:1 Volume:2019 Page:
ISSN:1687-1499
Container-title:EURASIP Journal on Wireless Communications and Networking
language:en
Short-container-title:J Wireless Com Network

Author:

Zhu Yonghua,Zhang Weilin,Chen Yihai,Gao Honghao

Abstract

AbstractServer workload in the form of cloud-end clusters is a key factor in server maintenance and task scheduling. How to balance and optimize hardware resources and computation resources should thus receive more attention. However, we have observed that the disordered execution of running application and batching seriously cuts down the efficiency of the server. To improve the workload prediction accuracy, this paper proposes an approach using the long short-term memory (LSTM) encoder-decoder network with attention mechanism. First, the approach extracts the sequential and contextual features of the historical workload data through the encoder network. Second, the model integrates the attention mechanism into the decoder network, through which the prediction for batch workloads can be carried out. Third, experiments carried out on Alibaba and Dinda workload traces dataset demonstrate that our method achieves state-of-the-art performance in mixed workload prediction in cloud computing environment. Furthermore, we also propose a scroll prediction method, which splits a long prediction sequence into several small sequences to monitor and control prediction accuracy. This work helps to dynamically guide the configuration for workload balancing.

Funder

National Key Research and Development Plan of China

Natural Science Foundation of Jilin Province

Publisher

Springer Science and Business Media LLC

Subject

Computer Networks and Communications,Computer Science Applications,Signal Processing

Link

http://link.springer.com/content/pdf/10.1186/s13638-019-1605-z.pdf

Reference47 articles.

1. Josep AD, Katz RA, Konwinski A, Lee G, Patterson D, Rabkin A. A view of cloud computing. Commun ACM. 2010;53.

2. Q. Zhang, L. Cheng, R. Boutaba, Cloud computing: state-of-the-art and research challenges. J Internet Serv Appl. 1, 7–18 (2010)

3. Rajan K, Kakadia D, Curino C, Krishnan S. PerfOrator: eloquent performance models for resource optimization. In: Proceedings of the Seventh ACM Symposium on Cloud Computing. ACM; 2016. p. 415–27.

4. Lianyong Qi, Jiguo Yu, Zhili Zhou. An invocation cost optimization method for web services in cloud environment. Scientific Programming, Volume 2017, Article ID 4358536, 9 pages, 2017.

5. L. Yang, I.T. Foster, J.M. Schopf, in international parallel and distributed processing symposium. Homeostatic and tendency-based CPU load predictions (2003), p. 42

Cited by 103 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. When wavelet decomposition meets external attention: a lightweight cloud server load prediction model;Journal of Cloud Computing;2024-08-20

2. A common feature-driven prediction model for multivariate time series data;Information Sciences;2024-08

3. Trust Management and Resource Optimization in Edge and Fog Computing Using the CyberGuard Framework;Sensors;2024-07-02

4. HMM-CPM: a cloud instance resource prediction method tracing the workload trends via hidden Markov model;Cluster Computing;2024-06-06

5. Toward Using Representation Learning for Cloud Resource Usage Forecasting;Proceedings of the 2024 Workshop on AI For Systems;2024-06-03