Optimal cost-effective design of parallel systems subject to imperfect fault-coverage

被引：9

作者：

Amari, S ^{[1
]}

McLaughlin, L ^{[1
]}

Yadlapati, B ^{[1
]}

机构：

[1] Relex Software Corp, Greensburg, PA 15601 USA

来源：

ANNUAL RELIABILITY AND MAINTAINABILITY SYMPOSIUM, 2003 PROCEEDINGS | 2003年

关键词：

imperfect fault-coverage; cost-effective design; optimal redundancy; fault-tolerant system; reliability maximization; mean time to failure;

D O I：

10.1109/RAMS.2003.1181898

中图分类号：

T [工业技术];

学科分类号：

08 ;

摘要：

Computer-based systems intended for critical applications are usually designed with sufficient redundancy to be tolerant of errors that may occur. However, under imperfect fault-coverage conditions (such as the system cannot adequately detect, locate, and recover from faults and errors in the system), system failures can result even when adequate redundancy is in place. Because parallel architecture is a well-known and powerful architecture for improving the reliability of fault-tolerant systems, this paper presents the cost-effective design policies of parallel systems subject to imperfect fault-coverage. The policies are designed by considering (1) cost of components, (2) failure cost of the system, (3) common-cause failures, and (4) performance levels of the system. Three kinds of cost functions are formulated considering that the total average cost of the system is based on: (1) system unreliability, (2) failure-time of the system, and (3) total processor-hours. It is shown that the MTTF (Mean Time To Failure) of the system decreases by increasing the spares beyond a certain limit. Therefore, this paper also presents optimal design policies to maximize the MTTF of these systems. The results of this paper-can also be applied to gracefully degradable systems.

引用

页码：29 / 34

页数：6

共 5 条

[1] Optimal reliability of systems subject to imperfect fault-coverage [J].