Affiliation:
1. Naveen Jindal School of Management, The University of Texas at Dallas, Richardson, Texas 75080
Abstract
A Near-Optimal Capacity and Scheduling Policy for Cloud Users In “Cloud Cost Optimization: Model, Bounds, and Asymptotics,” Qu, Dawande, and Janakiraman study a long-term, dynamic resource-optimization problem for firms managing various cloud resources to process incoming computing tasks over time. Cloud resources vary in attributes, such as computing speed, memory, accelerators, and storage, tailored to different computing tasks, such as data warehousing, scientific computing, machine learning, and data processing. Firms can choose reserved resources for long-term commitments at lower costs or on-demand resources for flexibility at a higher price. Firms face three key trade-offs: the cost disparity between reserved and on-demand resources, the variation in resource attributes affecting performance and cost, and the challenge of balancing delay and resource costs. We propose an asymptotically optimal policy, demonstrating its effectiveness through a detailed numerical study based on Amazon Web Services data.
Publisher
Institute for Operations Research and the Management Sciences (INFORMS)
Subject
Management Science and Operations Research,Computer Science Applications