Network survivability modeling

被引:117
作者
Heegaard, Poul E. [1 ]
Trivedi, Kishor S. [2 ]
机构
[1] Norwegian Univ Sci & Technol NTNU, Dept Telemat, N-7491 Trondheim, Norway
[2] Duke Univ, Pratt Sch Engn, Durham, NC USA
基金
美国国家科学基金会;
关键词
Survivability; End-to-end performance; Analytical models; Simulation; FRAMEWORK;
D O I
10.1016/j.comnet.2009.02.014
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Critical services in a telecommunication network should be continuously provided even when undesirable events like sabotage, natural disasters, or network failures happen. It is essential to provide virtual connections between peering nodes with certain performance guarantees such as minimum throughput, maximum delay or loss. The design, construction and management of virtual connections, network infrastructures and service platforms aim at meeting such requirements. In this paper we consider the network's ability to survive major and minor failures in network infrastructure and service platforms that are caused by undesired events that might be external or internal. Survive means that the services provided comply with the requirement also in presence of failures. The network survivability is quantified as defined by the ANSI T1A1.2 committee which is the transient performance from the instant an undesirable event occurs until steady state with an acceptable performance level is attained. The assessment of the survivability of a network with virtual connections exposed to link or node failures is addressed in this paper. We have developed both simulation and analytic models to cross-validate our assumptions. In order to avoid state space explosion while addressing large networks we decompose our models first in space by studying the nodes independently and then in time by decoupling our analytic performance and recovery models which gives us a closed form solution. The modeling approaches are applied to both small and real-sized network examples. Three different scenarios have been defined, including single link failure, hurricane disaster, and instabilities in a large block of the system (transient common failure). The results show very good correspondence between the transient loss and delay performance in our simulations and in the analytic approximations. (C) 2009 Elsevier B.V. All rights reserved.
引用
收藏
页码:1215 / 1234
页数:20
相关论文
共 52 条
[1]  
[Anonymous], 1997, CMUSEI97TR013
[2]  
[Anonymous], 2001, Probability and statistics with reliability, queueing, and computer science applications
[3]  
[Anonymous], NATL STRATEGY PHYS P
[4]  
ANSI T1A1.2 Working Group on Network Surviv-ability Performance, 2001, 68 ANSI TR
[5]  
AWDUCHE D, 1999, RFC2702 IETF
[6]   OPEN, CLOSED, AND MIXED NETWORKS OF QUEUES WITH DIFFERENT CLASSES OF CUSTOMERS [J].
BASKETT, F ;
CHANDY, KM ;
MUNTZ, RR ;
PALACIOS, FG .
JOURNAL OF THE ACM, 1975, 22 (02) :248-260
[7]  
Birtwistle G., 1997, DEMOS SYSTEM DISCRET
[8]  
BOBBIO A, 1986, IEEE T COMPUT, V35, P803, DOI 10.1109/TC.1986.1676840
[9]  
CALLON R, 1990, RFC1195 IETF
[10]  
CHEN DY, 2002, ACM INT WORKSH MOD A