Author:
Raeis Majid,Tizghadam Ali,Leon-Garcia Alberto
Abstract
End-to-end delay is a critical attribute of quality of service (QoS) in application domains such as cloud computing and computer networks. This metric is particularly important in tandem service systems, where the end-to-end service is provided through a chain of services. Service-rate control is a common mechanism for providing QoS guarantees in service systems. In this paper, we introduce a reinforcement learning-based (RL-based) service-rate controller that provides probabilistic upper-bounds on the end-to-end delay of the system, while preventing the overuse of service resources. In order to have a general framework, we use queueing theory to model the service systems. However, we adopt an RL-based approach to avoid the limitations of queueing-theoretic methods. In particular, we use Deep Deterministic Policy Gradient (DDPG) to learn the service rates (action) as a function of the queue lengths (state) in tandem service systems. In contrast to existing RL-based methods that quantify their performance by the achieved overall reward, which could be hard to interpret or even misleading, our proposed controller provides explicit probabilistic guarantees on the end-to-end delay of the system. The evaluations are presented for a tandem queueing system with non-exponential inter-arrival and service times, the results of which validate our controller's capability in meeting QoS constraints.
Publisher
Association for the Advancement of Artificial Intelligence (AAAI)
Cited by
8 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献
1. Scheduling of Low-Latency Medical Services in Healthcare Cloud with Deep Reinforcement Learning;Tsinghua Science and Technology;2025-02
2. An Optimal Admission Control Policy for Cloud Computing Services with Tandem Queues Based on Game Theory;2024 27th International Conference on Computer Supported Cooperative Work in Design (CSCWD);2024-05-08
3. Queue-Learning-Based QoE Optimization for Super-Resolution-Assisted Adaptive Video Streaming;GLOBECOM 2023 - 2023 IEEE Global Communications Conference;2023-12-04
4. Deep Reinforcement Learning for Power Control in Secure Broadcast Channels;2023 21st International Symposium on Modeling and Optimization in Mobile, Ad Hoc, and Wireless Networks (WiOpt);2023-08-24
5. PlanIoT: A Framework for Adaptive Data Flow Management in IoT-enhanced Spaces;2023 IEEE/ACM 18th Symposium on Software Engineering for Adaptive and Self-Managing Systems (SEAMS);2023-05