MODELING OF CORRELATED FAILURES AND COMMUNITY ERROR RECOVERY IN MULTIVERSION SOFTWARE

被引:48
作者
NICOLA, VF
GOYAL, A
机构
[1] IBM Thomas J. Watson Research Center, Yorktown Heights, NY 10598
关键词
Combinatorial methods; correlated failures; error recovery; fault-tolerance; multiversion programming; software reliability;
D O I
10.1109/32.48942
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
In this paper we consider three aspects in modeling of multiversion software. First, we propose the Beta-Binomial distribution to model correlated failures in multiversion software. Second, we present a combinatorial model to predict the reliability of a multiversion software configuration. This model can take as inputs failure distributions either from measurements or from a selected distribution (e.g., Beta-Binomial). Various recovery methods can be incorporated in this model. Third, we investigate the effectiveness of the Community Error Recovery method based on checkpointing as suggested in [13]. This method appears to be effective only when the failure behavior of program versions are lightly correlated. We also consider two different types of checkpoint failures: an omission failure where the correct output is recognized at a checkpoint but the checkpoint fails to correct the wrong outputs, and a destructive failure where the good versions get corrupted at a checkpoint. The former just reduces the effectiveness of the checkpoints while the latter has a catastrophic effect on the reliability. © 1990 IEEE
引用
收藏
页码:350 / 359
页数:10
相关论文
共 13 条
[1]   THE N-VERSION APPROACH TO FAULT-TOLERANT SOFTWARE [J].
AVIZIENIS, A .
IEEE TRANSACTIONS ON SOFTWARE ENGINEERING, 1985, 11 (12) :1491-1501
[2]  
AVIZIENIS A, 1985, P FTC15 ANN ARB, P126
[3]   A THEORETICAL BASIS FOR THE ANALYSIS OF MULTIVERSION SOFTWARE SUBJECT TO COINCIDENT ERRORS [J].
ECKHARDT, DE ;
LEE, LD .
IEEE TRANSACTIONS ON SOFTWARE ENGINEERING, 1985, 11 (12) :1511-1517
[4]   MAXIMUM LIKELIHOOD ESTIMATION FOR BETA-BINOMIAL DISTRIBUTION AND AN APPLICATION TO HOUSEHOLD DISTRIBUTION OF TOTAL NUMBER OF CASES OF A DISEASE [J].
GRIFFITHS, DA .
BIOMETRICS, 1973, 29 (04) :637-648
[5]  
HOGG RV, 1987, INTRO MATH STATISTIC
[6]  
KELLY J, 1988, P FTCS, V18, P9
[7]  
Knight J.C., 1986, P FTCS, V16, P165
[8]   AN EXPERIMENTAL EVALUATION OF THE ASSUMPTION OF INDEPENDENCE IN MULTIVERSION PROGRAMMING [J].
KNIGHT, JC ;
LEVESON, NG .
IEEE TRANSACTIONS ON SOFTWARE ENGINEERING, 1986, 12 (01) :96-109
[9]  
KNIGHT JC, 1985, P FTCS, V15, P135
[10]  
LAPRIE JC, 1987, P FTC, V17, P116