Optimizing the agent decisions for a Cloud actuator using Deep reinforcement learning

Author:

Sankaran Lakshmi1,JS Saleema2,Suleiman Basem3

Affiliation:

1. CMR University

2. Christ University

3. The University of Sydney

Abstract

Abstract With the increasing use of deep reinforcement learning (DRL) techniques to build intelligent systems, the application of it to real-world problems is rampant. Resource allocation in a cloud environment that need dynamic and auto-scaling features is evolving. The agent-based decisions that are offered by DRL are in use by software robotics. Auto-scaling of resources in cloud applications introduces intelligence to agents thus built by these DRL techniques. Markov decision process as a tool minimizes the target rewards to agents such that auto-scaling of applications is performed by agent decisions. Analysis of optimizing the convergence errors that are measured while the agent performs in an online environment is the challenge. Speedy Q-learning (SQL), Generalized SQL(GSQL) algorithm variants relax the parameter values of convergence with a model-free space. The authors applied heuristic values for one such relaxation parameter in our experiments. The study is an extension of works that introduced GSQL-w, where w is the convergence parameter. The authors designed a new GSQL-wh algorithm that heuristically fixes a value for w optimally in cases with over-utilization of resources. This is presented as a novel solution in this study for cloud resource workloads.

Publisher

Research Square Platform LLC

Reference13 articles.

1. D. Edsinger, “Auto-scaling cloud infrastructure with Reinforcement Learning A comparison between multiple RL algorithms to auto-scale resources in cloud infrastructure”, Chalmers University of Technology, Sweden 2018.

2. C. Bitsakos, I. Konstantinou, and N. Koziris, “DERP: A Deep Reinforcement Learning Cloud System for Elastic Resource Provisioning,” in 2018 IEEE International Conference on Cloud Computing Technology and Science (CloudCom), vol. 2018-Decem, Dec 2018, pp. 21–29,10.1109/CloudCom2018.2018.00020.

3. I. John and S. Bhatnagar, “Deep Reinforcement Learning with Successive Over-Relaxation and its Application in Autoscaling Cloud Resources,” International Joint Conference on Neural Networks (IJCNN), 2020, pp. 1–6, 10.1109/IJCNN48605.2020.9206598.

4. P. Singh, P. Gupta, K. Jyoti, and A. Nayyar, “Research on Auto-Scaling of Web Applications in Cloud: Survey, Trends and Future Directions,” Scalable Comput. Pract. Exp., vol. 20, no. 2, pp. 399–432, May 2019,10.12694/scope.v20i2.1537.

5. I. John, C. Kamanchi, and S. Bhatnagar, “Generalized Speedy Q-Learning,” IEEE Control Syst. Lett., vol. 4, no. 3, pp. 524–529, Jul. 2020,10.1109/LCSYS.2020.2970555.

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3