A fault-tolerant algorithm for decentralized on-line quorum adaptation

被引:5
作者
Bearden, M [1 ]
Bianchini, RP [1 ]
机构
[1] AT&T Bell Labs, Lucent Technol, Murray Hill, NJ 07974 USA
来源
TWENTY-EIGHTH ANNUAL INTERNATIONAL SYMPOSIUM ON FAULT-TOLERANT COMPUTING, DIGEST PAPERS | 1998年
关键词
D O I
10.1109/FTCS.1998.689477
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
A quorum-based distributed mutual exclusion protocol requires each processor in a distributed system to obtain permission from a quorum of processors before accessing a resource that cannot be concurrently shared. To prevent failed quorum members from blocking access to the resource, it is desirable to remove failed processors from quorums when failures are detected. This work addresses the problem of adapting quorums on-line, while a quorum-based mutual exclusion protocol continues to operate. To preserve the quorum intersection property that is required for mutual exclusion safety, it is necessary to coordinate changes made to the quorum data structures of different processors. A solution is given in the form of QADAPT, a decentralized algorithm that guarantees safe adaptation of quorums when processors fail. QADAPT enables any set of quorum adaptations that do not violate the quorum intersection property, and enables any set of faulty processors to be removed from quorums. QADAPT has optimal message passing cost and tolerates any number of processor (halting) failures. A distributed system model is assumed that provides only point-to-point messages with no message ordering. Results from an implementation show that the algorithm's execution rime scales well in a system containing up to fifty networked workstations. Extensions of this work include on-line adaptation of quorums that are used to maintain replica consistency in distributed databases.
引用
收藏
页码:262 / 271
页数:10
相关论文
empty
未找到相关数据