Author:
Albini Luiz Carlos Pessoa,Duarte Elias Procópio,Ziwich Roverli Pereira
Abstract
Abstract
This work introduces a new system-level diagnosis model and an algorithm based on this model: Hi-Comp (Hierarchical Comparison-based Adaptive Distributed System-Level Diagnosis algorithm). This algorithm allows the diagnosis of systems that can be represented by a complete graph. Hi-Comp is the first diagnosis algorithm that is, at the same time, hierarchical, distributed and comparison-based. The algorithm is not limited to crash fault diagnosis, because its tests are based on comparisons. To perform a test, a processor sends a task to two processors of the system that, after executing the task, send their outputs back to the tester. The tester compares the two outputs; if the comparison produces a match, the tester considers the tested processors fault-free; on the other hand, if the comparison produces a mismatch, the tester considers that at least one of the two tested processors is faulty, but can not determine which one. Considering a system of N nodes, it is proved that the algorithm’s diagnosability is (N-1) and the latency is log2N testing rounds. Furthermore, a formal proof of the maximum number of tests required per testing round is presented, which can be O(N3). Simulation results are also presented.
Publisher
Springer Science and Business Media LLC
Reference26 articles.
1. A. Subbiah, and D.M. Blough, “Distributed Diagnosis in Dynamic Fault Environments,”IEEE Transactions on Paralel and Distributed Systems, Vol. 15 No. 5, pp. 453–467, 2004.
2. G. Masson, D. Blough, and G. Sullivan, “System Diagnosis,”Fault-Tolerant Computer System Design, ed. D.K. Pradhan, Prentice-Hall, 1996.
3. F. Preparata, G. Metze, and R.T. Chien, “On The Connection Assignment Problem of Diagnosable Systems,”IEEE Transactions on Electronic Computers, Vol. 16, pp. 848–854, 1968.
4. S.L. Hakimi, and A.T. Amin, “Characterization of Connection Assignments of Diagnosable Systems,”IEEE Transactions on Computers, Vol. 23, pp. 86–88, 1974.
5. S.L. Hakimi, and K. Nakajima, “On Adaptive System Diagnosis,”IEEE Transactions on Computers, Vol. 33, pp. 234–240, 1984.
Cited by
2 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献