A cost-effective load-balancing policy for tile-based, massive multi-core packet processors-Reference-Cited by-同舟云学术

A cost-effective load-balancing policy for tile-based, massive multi-core packet processors

Published:2010-02 Issue:3 Volume:9 Page:1-25
ISSN:1539-9087
Container-title:ACM Transactions on Embedded Computing Systems
language:en
Short-container-title:ACM Trans. Embed. Comput. Syst.

Author:

Musoll Enric¹

Affiliation:

1. ConSentry Networks, Milpitas, CA

Abstract

Massive multi-core architectures provide a computation platform with high processing throughput, enabling the efficient processing of workloads with a significant degree of thread-level parallelism found in networking environments. Communication-centric workloads, like those in LAN and WAN environments, are fundamentally composed of sets of packets, named flows. The packets within a flow usually have dependencies among them, which reduce the amount of parallelism. However, packets of different flows tend to have very few or no dependencies among them, and thus can exploit thread-level parallelism to its fullest extent. Therefore, in massive tile-based multi-core architectures, it is important that the processing of the packets of a particular flow takes place in a set of cores physically close to each other to minimize the communication latency among those cores. Moreover, it is also desirable to spread out the processing of the different flows across all the cores of the processor in order to minimize the stress on a reduced number of cores, thus minimizing the potential for thermal hotspots and increasing the reliability of the processor. In addition, the burst-like nature of packet-based workloads render most of the cores idle most of the time, enabling large power savings by power gating these idle cores. This work presents a high-level study of the performance, power, and thermal behavior of tile-based architectures with a large number of cores executing flow-based packet workloads, and proposes a load-balancing policy of assigning packets to cores that minimizes the communication latency while featuring a hotspot-free thermal profile.

Publisher

Association for Computing Machinery (ACM)

Subject

Hardware and Architecture,Software

Link

https://dl.acm.org/doi/pdf/10.1145/1698772.1698782

Reference42 articles.

1. Leakage Power Analysis and Reduction for Nanoscale Circuits

2. Integration Challenges and Tradeoffs for Terascale Architectures

3. Design tradeoffs for tiled CMP on-chip networks

Cited by 3 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. References;Modeling and Optimization of Parallel and Distributed Embedded Systems;2016-01-08

2. High-performance optimizations on tiled many-core embedded systems: a matrix multiplication case study;The Journal of Supercomputing;2013-04-05

3. Variable-size mosaics: A process-variation aware technique to increase the performance of tile-based, massive multi-core processors;Computers & Electrical Engineering;2011-11