Steward: Scaling Byzantine Fault-Tolerant Replication to Wide Area Networks

被引:52
作者
Amir, Yair [1 ]
Danilov, Claudiu [2 ]
Dolev, Danny [3 ]
Kirsch, Jonathan [1 ]
Lane, John [1 ]
Nita-Rotaru, Cristina [4 ]
Olsen, Josh [5 ]
Zage, David [4 ]
机构
[1] Johns Hopkins Univ, Dept Comp Sci, Baltimore, MD 21218 USA
[2] Boeing Phantom Works, Seattle, WA 98124 USA
[3] Hebrew Univ Jerusalem, Sch Engn & Comp Sci, IL-91904 Jerusalem, Israel
[4] Purdue Univ, Dept Comp Sci, W Lafayette, IN 47907 USA
[5] Univ Calif Irvine, Donald Bren Sch Informat & Comp Sci, Irvine, CA 92697 USA
基金
美国国家科学基金会;
关键词
Fault tolerance; scalability; wide area networks; TIME;
D O I
10.1109/TDSC.2008.53
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
This paper presents the first hierarchical Byzantine fault-tolerant replication architecture suitable to systems that span multiple wide-area sites. The architecture confines the effects of any malicious replica to its local site, reduces message complexity of wide-area communication, and allows read-only queries to be performed locally within a site for the price of additional standard hardware. We present proofs that our algorithm provides safety and liveness properties. A prototype implementation is evaluated over several network topologies and is compared with a flat Byzantine fault-tolerant approach. The experimental results show considerable improvement over flat Byzantine replication algorithms, bringing the performance of Byzantine replication closer to existing benign fault-tolerant replication techniques over wide area networks.
引用
收藏
页码:80 / 93
页数:14
相关论文
共 40 条
[1]  
Abd-El-Malek M, 2005, USENIX Association Proceedings of the 4th Usenix Conference on File and Storage Technologies, P59
[2]   Reliable communication in overlay networks [J].
Amir, Y ;
Danilov, C .
2003 INTERNATIONAL CONFERENCE ON DEPENDABLE SYSTEMS AND NETWORKS, PROCEEDINGS, 2003, :511-520
[3]  
AMIR Y, 2003, CNDS20033
[4]   Customizable fault tolerance for wide-area replication [J].
Amir, Yair ;
Coan, Brian ;
Kirsch, Jonathan ;
Lane, John .
SRDS 2007: 26TH IEEE INTERNATIONAL SYMPOSIUM ON RELIABLE DISTRIBUTED SYSTEMS, PROCEEDINGS, 2007, :66-+
[5]   Scaling Byzantine fault-tolerant replication to wide area networks [J].
Amir, Yair ;
Danilov, Claudiu ;
Dolev, Danny ;
Kirsch, Jonathan ;
Lane, John ;
Nita-Rotaru, Cristina ;
Olsen, Josh ;
Zage, David .
DSN 2006 INTERNATIONAL CONFERENCE ON DEPENDABLE SYSTEMS AND NETWORKS, PROCEEDINGS, 2006, :105-114
[6]  
[Anonymous], 1983, Foundations of Computation Theory, volume 158 of Lecture Notes in Computer Science
[7]  
[Anonymous], ACM SIGACT NEWS
[8]   THE N-VERSION APPROACH TO FAULT-TOLERANT SOFTWARE [J].
AVIZIENIS, A .
IEEE TRANSACTIONS ON SOFTWARE ENGINEERING, 1985, 11 (12) :1491-1501
[9]  
Bernstein P.A., 1987, Concurrency Control and Recovery in Database Systems
[10]   ARGO upper salinity measurements: Perspectives for L-band radiometers calibration and retrieved sea surface salinity validation [J].
Boutin, J ;
Martin, N .
IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2006, 3 (02) :202-206