A survey of checker architectures-Reference-Cited by-同舟云学术

A survey of checker architectures

Published:2013-08 Issue:4 Volume:45 Page:1-34
ISSN:0360-0300
Container-title:ACM Computing Surveys
language:en
Short-container-title:ACM Comput. Surv.

Author:

Kalayappan Rajshekar¹,Sarangi Smruti R.¹

Affiliation:

1. Indian Institute of Technology, New Delhi, India

Abstract

Reliability is quickly becoming a primary design constraint for high-end processors because of the inherent limits of manufacturability, extreme miniaturization of transistors, and the growing complexity of large multicore chips. To achieve a high degree of fault tolerance, we need to detect faults quickly and try to rectify them. In this article, we focus on the former aspect. We present a survey of different kinds of fault detection mechanisms for processors at circuit, architecture, and software level. We collectively refer to such mechanisms as checker architectures . First, we propose a novel two-level taxonomy for different classes of checkers based on their structure and functionality. Subsequently, for each class we present the ideas in some of the seminal papers that have defined the direction of the area along with important extensions published in later work.

Publisher

Association for Computing Machinery (ACM)

Subject

General Computer Science,Theoretical Computer Science

Link

https://dl.acm.org/doi/pdf/10.1145/2501654.2501662

Reference97 articles.

1. Shared memory consistency models: a tutorial

2. Design and evaluation of system-level checks for on-line control flow error detection

3. Necromancer

Cited by 10 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Survey on Redundancy Based-Fault tolerance methods for Processors and Hardware accelerators - Trends in Quantum Computing, Heterogeneous Systems and Reliability;ACM Computing Surveys;2024-06-28

2. A Formal Approach to Accountability in Heterogeneous Systems-on-Chip;IEEE Transactions on Dependable and Secure Computing;2021-11-01

3. Multi-core Devices for Safety-critical Systems;ACM Computing Surveys;2021-07-31

4. Binary Tree Classification of Rigid Error Detection and Correction Techniques;ACM Computing Surveys;2021-07-31

5. SeRA: Self-Repairing Architecture for Dark Silicon Era;Journal of Circuits, Systems and Computers;2019-06-13