Affiliation:
1. Carnegie Mellon University, Pittsburgh, PA
Abstract
Recent studies have suggested that the soft-error rate in microprocessor logic will become a reliability concern by 2010. This paper proposes an efficient error detection technique, called
fingerprinting
, that detects differences in execution across a dual modular redundant (DMR) processor pair. Fingerprinting summarizes a processor's execution history in a hash-based signature; differences between two mirrored processors are exposed by comparing their fingerprints. Fingerprinting tightly bounds detection latency and greatly reduces the interprocessor communication bandwidth required for checking. This paper presents a study that evaluates fingerprinting against a range of current approaches to error detection. The result of this study shows that fingerprinting is the only error detection mechanism that simultaneously allows high-error coverage, low error detection bandwidth, and high I/O performance.
Publisher
Association for Computing Machinery (ACM)
Subject
Computer Graphics and Computer-Aided Design,Software
Cited by
24 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献
1. SoftFusion: A Low-Cost Approach to Enhance Reliability of Object Detection Applications;2022 IEEE 40th International Conference on Computer Design (ICCD);2022-10
2. EXPERTISE: An Effective Software-level Redundant Multithreading Scheme against Hardware Faults;ACM Transactions on Architecture and Code Optimization;2022-09-16
3. Studying error propagation on application data structure and hardware;The Journal of Supercomputing;2022-06-13
4. Reliability-Aware Runahead;2022 IEEE International Symposium on High-Performance Computer Architecture (HPCA);2022-04
5. Hybrid Quick Error Detection: Validation and Debug of SoCs Through High-Level Synthesis;IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems;2019-07