A HYBRID MONITOR FOR BEHAVIOR AND PERFORMANCE ANALYSIS OF DISTRIBUTED SYSTEMS

被引:41
作者
HABAN, D [1 ]
WYBRANIETZ, D [1 ]
机构
[1] UNIV KAISERSLAUTERN,DEPT COMP SCI,W-6750 KAISERSLAUTERN,GERMANY
关键词
Data analysis; Distributed system; Evaluation; Graphical display; Hybrid monitor; Instrumentation; Measuring;
D O I
10.1109/32.44382
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
This paper describes a hybrid monitor for measuring the performance and observing the behavior of distributed systems during execution. Major problems are the interference of the monitoring system with the system to be monitored and the presentation of meaningful data to users. Monitoring becomes even more difficult in distributed systems since these systems feature asynchronous concurrent activities, nondeterministic behavior and lack central control, precise global time and accurate global state. We emphasize the data collection, analysis and presentation of execution data. A special hardware support, which consists of a test and measurement processor (TMP), was designed and has been implemented in the nodes of an experimental multicomputer system consisting of eleven nodes. The operations of the TMP are completely transparent with a minimal, less than 0.1 percent, overhead to the measured system. In the experimental system, all the TMP's were connected with a central monitoring station, using an independent communication network, in order to provide a global view of the monitored system. The central monitoring station displays the resulting information in easy-to-read charts and graphs. Our experience with the TMP shows that it promotes an improved understanding of run-time behavior and performance measurements, to derive qualitative and quantitative assessments of distributed systems. © 1990 IEEE
引用
收藏
页码:197 / 211
页数:15
相关论文
共 29 条
[1]  
BATES B, 1983, J SYST SOFTWARE MAR, P225
[2]  
BROWN C, 1985, COMPUT SCI COMPUT EN
[3]  
FERRARI D, 1981, EXPT COMPUTER PERFOR
[4]   EVALUATING 2 MASSIVELY PARALLEL MACHINES [J].
FRENKEL, KA .
COMMUNICATIONS OF THE ACM, 1986, 29 (08) :752-758
[5]  
FROMM H, 1983, IEEE T COMPUT, V32, P15, DOI 10.1109/TC.1983.1676120
[6]   DEBUGGING A DISTRIBUTED COMPUTING SYSTEM [J].
GARCIAMOLINA, H ;
GERMANO, F ;
KOHLER, WH .
IEEE TRANSACTIONS ON SOFTWARE ENGINEERING, 1984, 10 (02) :210-219
[7]  
GUESELLA R, 1989, IEEE T SOFTWARE ENG, V15, P847
[8]  
HABAN D, IN PRESS SPRINGER LN
[9]  
HABAN D, 1987, 6TH P S REL DISTR SO, P66
[10]  
HABAN D, 1989, IN PRESS 10TH P REAL