Analytic modeling of network processors for parallel workload mapping-Reference-Cited by-同舟云学术

Analytic modeling of network processors for parallel workload mapping

Published:2009-04 Issue:3 Volume:8 Page:1-29
ISSN:1539-9087
Container-title:ACM Transactions on Embedded Computing Systems
language:en
Short-container-title:ACM Trans. Embed. Comput. Syst.

Author:

Weng Ning¹,Wolf Tilman²

Affiliation:

1. Southern Illinois University Carbondale, Carbondale, IL

2. University of Massachusetts Amherst, Amherst, MA

Abstract

Network processors are heterogeneous system-on-chip multiprocessors that are optimized to perform packet forwarding and processing tasks at Gigabit data rates. To meet the performance demands of increasing link speeds and complex network applications, network processors are implemented with several dozen embedded processor cores and hardware accelerators that run multiple packet processing applications in parallel. The parallel nature of the processing system makes it increasingly difficult for application developers to understand and manage resources and map processing tasks to the hardware. To address this problem, we present a methodology for profiling and analyzing network processor applications, mapping processing tasks to a generalized network processor architecture, and analytically determining the expected throughput performance. The key novelty of this work is not only the adaptation of application analysis and mapping algorithms to heterogeneous network processors, but also that the entire process can be automated and hidden from the application developer. Starting with the analysis of a uniprocessor implementation of the application, the process yields a mapping of the partitioned application that shows best performance for a given network processor system. The simplicity of the proposed randomized mapping algorithm allows the use of this methodology in network processor runtime systems where dynamic reallocation of tasks is necessary but processing power is limited. We present results that show the effectiveness of the analysis and mapping methodology as well as its application to design space exploration.

Publisher

Association for Computing Machinery (ACM)

Subject

Hardware and Architecture,Software

Link

https://dl.acm.org/doi/pdf/10.1145/1509288.1509290

Reference35 articles.

1. Performance tradeoffs in multithreaded processors

2. Austin T. M. and Sohi G. S. 1993. Tetra: evaluation of serial program performance on fine-grain parallel processors. Tech. rep. 1163 Computer Science Department University of Wisconsin Madison. Austin T. M. and Sohi G. S. 1993. Tetra: evaluation of serial program performance on fine-grain parallel processors. Tech. rep. 1163 Computer Science Department University of Wisconsin Madison.

3. Baker F. 1995. Requirements for IP version 4 routers. RFC 1812 Network Working Group. Baker F. 1995. Requirements for IP version 4 routers. RFC 1812 Network Working Group.

4. Analysis of Memory Interference in Multiprocessors

5. Daemen J. and Rijmen V. 2000. The block cipher Rijndael. Lecture Notes in Computer Science. Vol. 1820. Springer-Verlag Berlin Germany 288--296. Daemen J. and Rijmen V. 2000. The block cipher Rijndael. Lecture Notes in Computer Science. Vol. 1820. Springer-Verlag Berlin Germany 288--296.

Cited by 6 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Culture-specific conceptualisations relating to corruption in China English;Lingua;2020-10

2. Research on Packet-Processing Architecture Based on Multi-core Processor;2014 Sixth International Conference on Measuring Technology and Mechatronics Automation;2014-01

3. MAPS: Mapping Concurrent Dataflow Applications to Heterogeneous MPSoCs;IEEE Transactions on Industrial Informatics;2013-02

4. Analytical Performance Models for MapReduce Workloads;International Journal of Parallel Programming;2012-11-27

5. Detection and Mitigation of High-Rate Flooding Attacks;An Investigation into the Detection and Mitigation of Denial of Service (DoS) Attacks;2011