A CASE-STUDY OF ETHERNET ANOMALIES IN A DISTRIBUTED COMPUTING ENVIRONMENT

被引:39
作者
MAXION, RA [1 ]
FEATHER, FE [1 ]
机构
[1] CARNEGIE MELLON UNIV,DEPT ELECT & COMP ENGN,PITTSBURGH,PA 15213
关键词
Anomaly detection; Diagnosis; Network faults;
D O I
10.1109/24.58721
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Distributed computing systems, or networks, are notoriously difficult environments in which to detect and diagnose faults. Fault detection and diagnosis depend critically on good fault definitions, but the dynamic, noisy and nonstationary character of networks makes it hard to define what a fault is in a network environment. The work presented in this paper takes the position that a fault or failure is a violation of expectations. In accordance with empirically based expectations, operating behaviors of networks (and other devices) can be classed as being either normal or anomalous. Because network failures most frequently manifest themselves as performance degradations, or deviations from expected behavior, periods of anomalous performance can be attributed to causes assignable as network faults. The half-year case study presented here employed a system in which observations of distributed computing network behavior were automatically and systematically grouped into two classes: normal and anomalous. Anomalous behaviors were traced to faulty conditions. In a preliminary effort to understand and catalog how networks behave under various conditions, two cases of anomalous behavior are analyzed in detail. Examples are taken from the distributed file system network at Carnegie Mellon University. © 1990 IEEE
引用
收藏
页码:433 / 443
页数:11
相关论文
共 15 条
[1]  
AVIZIENIS A, 1975, 5TH P INT FAULT TOL, P3
[2]  
BOGGS DR, 1988, SIGCOMM 88 COMMUNICA
[3]  
Box G.E.P., 1976, TIME SERIES ANAL
[4]  
Laprie J.-C., 1985, Fifteenth Annual International Symposium on Fault-Tolerant Computing FTCS 15. Digest of Papers. (Cat. No. 85CH2143-6), P2
[5]  
LEONG J, 1988, PRACTICAL GUIDE ETHE
[6]  
LORENZ K, 1981, F ETHOLOGY
[7]  
MAXION RA, 1990, 20TH IEEE INT C FAUL, P20
[8]   ANDREW - A DISTRIBUTED PERSONAL COMPUTING ENVIRONMENT [J].
MORRIS, JH ;
SATYANARAYANAN, M ;
CONNER, MH ;
HOWARD, JH ;
ROSENTHAL, DSH ;
SMITH, FD .
COMMUNICATIONS OF THE ACM, 1986, 29 (03) :184-201
[9]  
RITTER D, 1987, IEEE NETWORK, V1
[10]  
ROBERT MM, 1976, COMMUN ACM, V19, P395