Affiliation:
1. University of Massachusetts, Amherst, MA
2. IBM T. J. Watson Research Center, Hawthorne, NY
Abstract
Since many Internet applications employ a multi-tier architecture, in this paper, we focus on the problem of analytically modeling the behavior of such applications. We present a model based on a network of queues, where the queues represent different tiers of the application. Our model is sufficiently general to capture (i) the behavior of tiers with significantly different performance characteristics and (ii) application idiosyncrasies such as session-based workloads, concurrency limits, and caching at intermediate tiers. We validate our model using real multi-tier applications running on a Linux server cluster. Our experiments indicate that our model faithfully captures the performance of these applications for a number of workloads and configurations. For a variety of scenarios, including those with caching at one of the application tiers, the average response times predicted by our model were within the 95% confidence intervals of the observed average response times. Our experiments also demonstrate the utility of the model for dynamic capacity provisioning, performance prediction, bottleneck identification, and session policing. In one scenario, where the request arrival rate increased from less than 1500 to nearly 4200 requests/min, a dynamic provisioning technique employing our model was able to maintain response time targets by increasing the capacity of two of the application tiers by factors of 2 and 3.5, respectively.
Publisher
Association for Computing Machinery (ACM)
Subject
Computer Networks and Communications,Hardware and Architecture,Software
Reference20 articles.
1. Performance guarantees for Web server end-systems: a control-theoretical approach
2. Dynaserver project. http://compsci.rice.edu/CS/Systems/DynaServer/. Dynaserver project. http://compsci.rice.edu/CS/Systems/DynaServer/.
3. An approach to predictive detection for service management
Cited by
60 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献
1. Learning optimal admission control in partially observable queueing networks;Queueing Systems;2024-06-29
2. Prediction of Cloud API Performance Using Uncertainty-Based Fusion of Predictive and Analytical Modeling;2023 IEEE International Conference on High Performance Computing & Communications, Data Science & Systems, Smart City & Dependability in Sensor, Cloud & Big Data Systems & Application (HPCC/DSS/SmartCity/DependSys);2023-12-17
3. Exploring nonintrusive measurements of spatio‐temporal portrait of microservices;Software: Practice and Experience;2023-06-27
4. An In-Depth Study of Microservice Call Graph and Runtime Performance;IEEE Transactions on Parallel and Distributed Systems;2022-12-01
5. State Space Model and Queuing Network Based Cloud Resource Provisioning for Meshed Web Systems;IEEE Transactions on Parallel and Distributed Systems;2022-12-01