Author:
Katal Avita,Dahiya Susheela,Choudhury Tanupriya
Abstract
Advancements in virtualization technology have led to better utilization of existing infrastructure. It allows numerous virtual machines with different workloads to coexist on the same physical server, resulting in a pool of server resources. It is critical to understand enterprise workloads to correctly create and configure existing and future support in such pools. Managing resources in a cloud data center is one of the most difficult tasks. The dynamic nature of the cloud environment, as well as the high level of uncertainty, has created these challenges. These applications’ diverse Quality of Service (QoS) requirements make data center management difficult. Accurate forecasting of future resource demand is required to meet QoS needs and ensure better resource utilization. Consequently, data center workload modeling and categorization are needed to meet software quality solutions cost-effectively. This paper uses traces of Bitbrain’s data to characterize and categorize workload. Clustering (K Means and Gaussian mixture model) and Classification strategies (K Nearest Neighbors, Logistic Regression, Decision Trees, Random Forest, and Support Vector Machine) characterize and model the workload traces. K Means shows better results as compared to GMM when compared to the Calinski Harabasz index and Davies-Bouldin score. The results showed that the Decision Tree achieves the maximum accuracy of 99.18%, followed by K Nearest Neighbor (KNN), Random Forest (RF), Support Vector Machine (SVM) Logistic Regression (LR), Multi-Layer Perceptron (MLP), and Back Propagation Neural Networks.
Publisher
Universiti Putra Malaysia
Subject
General Earth and Planetary Sciences,General Environmental Science
Reference27 articles.
1. Abrahao, B., & Zhang, A. (2004) Characterizing application workloads on CPU utilization for utility computing (HPL-2004-157). Hewlett-Packard Company. https://www.hpl.hp.com/techreports/2004/HPL-2004-157.html
2. Ali-Eldin, A., Rezaie, A., Mehta, A., Razroev, S., Luna, S. S. de, Seleznjev, O., Tordsson, J., & Elmroth, E. (2014, March 11-14). How will your workload look like in 6 years? Analyzing Wikimedia’s workload. [Paper presentation]. 2014 IEEE International Conference on Cloud Engineering, Boston, USA. https://doi.org/10.1109/IC2E.2014.50
3. Bennani, M. N., & Menascé, D. A. (2005, June 13-16). Resource allocation for autonomic data centers using analytic performance models. [Paper presentation]. Second International Conference on Autonomic Computing, ICAC’05. Seattle, USA. https://doi.org/10.1109/ICAC.2005.50
4. Bienia, C., Kumar, S., Singh, J. P., & Li, K. (2008, October 25-29). The PARSEC benchmark suite: Characterization and architectural implications. [Paper presentation]. Proceedings of the 17th International Conference on Parallel Architectures and Compilation Techniques. Toronto, Canada. https://doi.org/10.1145/1454115.1454128
5. Birke, R., Chen, L. Y., & Smirni, E. (2014, May 5-9). Multi-resource characterization and their (in) dependencies in production datacenters. [Paper presentation]. IEEE/IFIP Network Operations and Management Symposium (NOMS), Krakow, Poland. https://doi.org/10.1109/NOMS.2014.6838300
Cited by
1 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献