FAULT INJECTION EXPERIMENTS USING FIAT

被引:99
作者
BARTON, JH [1 ]
CZECK, EW [1 ]
SEGALL, ZZ [1 ]
SIEWIOREK, DP [1 ]
机构
[1] CARNEGIE MELLON UNIV,CTR DEPENDABLE SYST,DEPT ELECT & COMP ENGN,PITTSBURGH,PA 15208
关键词
Dependable systems; experimental evaluation; fault injection; system monitoring; system validation;
D O I
10.1109/12.54853
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
This paper presents the results of several experiments conducted using the FIAT (fault injection-based automated testing) system. FIAT is capable of emulating a variety of distributed system architectures, and provides the capabilities to monitor system behavior and inject faults for the purpose of experimental characterization and validation of a system's dependability. The experiments consisted of exhaustively injecting three separate fault types into various locations, encompassing both the code and data portions of memory images, of two distinct applications executed with several different data values and sizes. Fault types were variations of memory bit faults. The results have shown that there are a limited number of system-level fault manifestations. These manifestations follow a normal distribution for each fault type. Error detection latencies were also found to be normally distributed. Thus, sampling techniques can be used to cut down the size of the fault injection space without sacrificing fidelity. Finally, the methodology can be used to predict the system-level fault responses during the system design stage. As an example, a model is developed for the occurrence of illegal opcode traps, based on the instruction set architecture, a static analysis of the workload, and a model of the low-level faults. The prediction of system-level responses is shown to be within 10 percent of the experimental data. © 1990 IEEE
引用
收藏
页码:575 / 582
页数:8
相关论文
共 16 条
[1]  
ADRION WR, 1982, ACM COMPUT SURV, V14, P159, DOI DOI 10.1145/356876.356879
[2]  
ARLAT J, 1989, 19TH P INT S FAULT T, P348
[3]  
CHILLAREGE R, 1989, 19TH P INT S FAULT T, P355
[4]  
CZECK EW, 1989, NASA CR4244 CARN U
[5]   CHARACTERIZATION OF FAULT RECOVERY THROUGH FAULT INJECTION ON FTMP [J].
FINELLI, GB .
IEEE TRANSACTIONS ON RELIABILITY, 1987, 36 (02) :164-170
[6]  
LALA J, 1983, 5TH P AIAA IEEE DIG
[7]  
Laprie J.-C., 1985, Fifteenth Annual International Symposium on Fault-Tolerant Computing FTCS 15. Digest of Papers. (Cat. No. 85CH2143-6), P2
[8]  
LOMELINO D, 1986, NASA CR176501 U ILL
[9]  
MCGOUGH JG, 1983, AUG P IEEE EASCON C, P299
[10]  
Mendenhall W., 1979, INTRO PROBABILITY ST