Cooperative Multi-Agent Reinforcement Learning-Based Co-optimization of Cores, Caches, and On-chip Network-Reference-Cited by-同舟云学术

Cooperative Multi-Agent Reinforcement Learning-Based Co-optimization of Cores, Caches, and On-chip Network

Published:2017-12-20 Issue:4 Volume:14 Page:1-25
ISSN:1544-3566
Container-title:ACM Transactions on Architecture and Code Optimization
language:en
Short-container-title:ACM Trans. Archit. Code Optim.

Author:

Jain Rahul¹,Panda Preeti Ranjan¹,Subramoney Sreenivas²

Affiliation:

1. Indian Institute of Technology Delhi, New Delhi, India

2. Microarchitecture Research Lab, Intel India, Bangalore, Karnataka

Abstract

Modern multi-core systems provide huge computational capabilities, which can be used to run multiple processes concurrently. To achieve the best possible performance within limited power budgets, the various system resources need to be allocated effectively. Any mismatch between runtime resource requirement and allocation leads to a sub-optimal energy-delay product (EDP). Different optimization techniques exist for addressing the problem of mismatch between the dynamic requirement and runtime allocation of the system resources. Choosing between multiple optimizations at runtime is complex due to the non-additive effects, making the scenario suitable for the application of machine learning techniques. We present a novel method, Machine Learned Machines (MLM), by using online reinforcement learning (RL) to perform dynamic partitioning of the last level cache (LLC), along with dynamic voltage and frequency scaling (DVFS) of the core and uncore (interconnection network and LLC). We have proposed and evaluated three different MLM co-optimization techniques based on independent and cooperative multi-agent learners. We show that the co-optimization results in a much lower system EDP than any of the techniques applied individually. We explore various RL models targeted toward optimization of different system metrics and study their effects on a system EDP, system throughput (STP), and Fairness. The various proposed techniques have been extensively evaluated with a mix of 20 workloads on a 4-core system using Spec2006 benchmarks. We have further evaluated our cooperative MLM techniques on a 16-core system. The results show an average of 20.5% and 19.1% system EDP improvement on a 4-core and 16-core system, respectively, with limited degradation of STP and Fairness.

Publisher

Association for Computing Machinery (ACM)

Subject

Hardware and Architecture,Information Systems,Software

Link

https://dl.acm.org/doi/pdf/10.1145/3132170

Reference46 articles.

1. Analysis of dynamic power management on multi-core processors

2. Core-Level Activity Prediction for Multicore Power Management

3. Coordinated management of multiple interacting resources in chip multiprocessors: A machine learning approach

4. Predictive coordination of multiple on-chip resources for chip multiprocessors

5. Modeling program resource demand using inherent program characteristics

Cited by 7 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. A survey on multi-agent reinforcement learning and its application;Journal of Automation and Intelligence;2024-06

2. RL-CoPref: a reinforcement learning-based coordinated prefetching controller for multiple prefetchers;The Journal of Supercomputing;2024-02-27

3. NPU-Accelerated Imitation Learning for Thermal Optimization of QoS-Constrained Heterogeneous Multi-Cores;ACM Transactions on Design Automation of Electronic Systems;2023-11-15

4. A Fairness-Aware Cooperation Strategy for Multi-Agent Systems Driven by Deep Reinforcement Learning;2022 41st Chinese Control Conference (CCC);2022-07-25

5. Bringing Fairness to Actor-Critic Reinforcement Learning for Network Utility Optimization;IEEE INFOCOM 2021 - IEEE Conference on Computer Communications;2021-05-10