Component based design of multitolerant systems

被引:49
作者
Arora, A [1 ]
Kulkarni, SS [1 ]
机构
[1] Ohio State Univ, Dept Comp & Informat Sci, Columbus, OH 43210 USA
基金
美国国家科学基金会;
关键词
formal methods; compositional design; interference-freedom; stepwise design; detectors; correctors; dependability; fault-tolerance; graceful degradation;
D O I
10.1109/32.663998
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
The concept of multitolerance abstracts problems in system dependability and provides a basis for improved design of dependable systems. In the abstraction, each source of undependability in the system is represented as a class of faults, and the corresponding ability of the system to deal with that undependability source is represented as a type of tolerance. Multitolerance thus refers to the ability of the system to tolerate multiple fault-classes, each in a possibly different way. In this paper, we present a component based method for designing multitolerance. Two types of components are employed by the method, namely detectors and correctors. A theory of detectors, correctors, and their interference-free composition with intolerant programs is developed, that enables stepwise addition of components to provide tolerance to a new fault-class while preserving the tolerances to the previously added fault-classes. We illustrate the method by designing a fully distributed multitolerant program for a token ring.
引用
收藏
页码:63 / 78
页数:16
相关论文
共 27 条
[1]   THE EXISTENCE OF REFINEMENT MAPPINGS [J].
ABADI, M ;
LAMPORT, L .
THEORETICAL COMPUTER SCIENCE, 1991, 82 (02) :253-284
[2]   DEFINING LIVENESS [J].
ALPERN, B ;
SCHNEIDER, FB .
INFORMATION PROCESSING LETTERS, 1985, 21 (04) :181-185
[3]  
Alpern B., 1987, Proceedings of the Symposium on Logic in Computer Science (Cat. No.87CH2464-6), P131
[4]  
[Anonymous], 1981, SCI PROGRAMMING, DOI DOI 10.1007/978-1-4612-5983-1
[5]  
Arora A, 1996, J HIGH SPEED NETW, V5, P293
[6]   CLOSURE AND CONVERGENCE - A FOUNDATION OF FAULT-TOLERANT COMPUTING [J].
ARORA, A ;
GOUDA, M .
IEEE TRANSACTIONS ON SOFTWARE ENGINEERING, 1993, 19 (11) :1015-1027
[7]  
ARORA A, 1995, 14TH SYMPOSIUM ON RELIABLE DISTRIBUTED SYSTEMS, PROCEEDINGS, P174, DOI 10.1109/RELDIS.1995.526225
[8]  
ARORA A, 1996, 0796 OSUCISRC
[9]  
ARORA A, 1992, THESIS U TEXAS AUSTI
[10]  
BACK RJR, 1994, ACM T SOFTW ENG METH, V3, P133