An optimizing pipeline stall reduction algorithm for power and performance on multi-core CPUs-Reference-Cited by-同舟云学术

An optimizing pipeline stall reduction algorithm for power and performance on multi-core CPUs

Published:2015-01-29 Issue:1 Volume:5 Page:
ISSN:2192-1962
Container-title:Human-centric Computing and Information Sciences
language:en
Short-container-title:Hum. Cent. Comput. Inf. Sci.

Author:

Saravanan Vijayalakshmi,Pralhaddas Kothari Dwarkadas,Kothari Dwarkadas Pralhaddas,Woungang Isaac

Abstract

AbstractThe power-performance trade-off is one of the major considerations in micro-architecture design. Pipelined architecture has brought a radical change in the design to capitalize on the parallel operation of various functional blocks involved in the instruction execution process, which is widely used in all modern processors. Pipeline introduces the instruction level parallelism (ILP) because of the potential overlap of instructions, and it does have drawbacks in the form of hazards, which is a result of data dependencies and resource conflicts. To overcome these hazards, stalls were introduced, which are basically delayed execution of instructions to diffuse the problematic situation. Out-of-order (OOO) execution is a ramification of the stall approach since it executes the instruction in an order governed by the availability of the input data rather than by their original order in the program. This paper presents a new algorithm called Left-Right (LR) for reducing stalls in pipelined processors. This algorithm is built by combining the traditional in-order and the out-of-order (OOO) instruction execution, resulting in the best of both approaches. As instruction input, we take the Tomasulo’s algorithm for scheduling out-of-order and the in-order instruction execution and we compare the proposed algorithm’s efficiency against both in terms of power-performance gain. Experimental simulations are conducted using Sim-Panalyzer, an instruction level simulator, showing that our proposed algorithm optimizes the power-performance with an effective increase of 30% in terms of energy consumption benefits compared to the Tomasulo’s algorithm and 3% compared to the in-order algorithm.

Publisher

Springer Science and Business Media LLC

Subject

General Computer Science

Link

https://link.springer.com/content/pdf/10.1186/s13673-014-0016-8.pdf

Reference27 articles.

1. Kogge PM (1981) The Architecture of pipelined computers. McGraw-Hill advanced computer science series, Hemisphere, Washington, New York, Paris, Includes index.

2. Johnson WM (1989) Super-scalar processor design. Technical report.

3. Hartstein A, Puzak TR (2002) The optimum pipeline depth for a microprocessor. SIGARCH Comput Archit News 30(2): 7–13.

4. Shamshiri S, Esmaeilzadeh H, Navabi Z (2005) Instruction-level test methodology for cpu core self-testing. ACM Trans Des Autom Electron Syst 10(4): 673–689.

5. Patterson DA, Hennessy JL (2006) In praise of computer architecture: a quantitative approach. Number 704. Morgan Kaufmann.

Cited by 45 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Application of Mobile Health in Mental Health Interventions;2023 Annual International Conference on Emerging Research Areas: International Conference on Intelligent Systems (AICERA/ICIS);2023-11-16

2. Research on recognition of students attention in offline classroom-based on deep learning;Education and Information Technologies;2023-08-09

3. A Study on the Construction of the Evaluation System of the Teaching Ability of Students using Pattern Recognition for Studying Majoring in Badminton in the Mixed Learning Model of Physical Education Majors and Self-Learning System;ACM Transactions on Asian and Low-Resource Language Information Processing;2022-11-12

4. Activation‐based recurrent learning method for wearable sensor data processing in human activity recognition;Transactions on Emerging Telecommunications Technologies;2022-11-11

5. Comparative Study on the Teaching of Japanese Listening and Speaking in the Language Room of Teachers' Intelligent Multimedia Systems in Chinese and Foreign Universities;ACM Transactions on Asian and Low-Resource Language Information Processing;2022-11-11