Early detection of network element outages based on customer trouble calls

被引:9
作者
Deljac, Zeljko [1 ]
Randic, Mirko [2 ]
Krcelic, Gordan [1 ]
机构
[1] T Hrvatski Telekom, Tech Funct, Savska 32, Zagreb, Croatia
[2] Univ Zagreb, Fac Elect Engn & Comp, Zagreb 41000, Croatia
关键词
Fault management; Broadband network; Early fault detection; Alarm system; Fault detection delay; INTRUSION DETECTION METHOD; ANOMALY DETECTION; FAULT-DETECTION; REDUCTION; SYSTEMS; ALARMS;
D O I
10.1016/j.dss.2015.02.014
中图分类号
TP18 [人工智能理论];
学科分类号
140502 [人工智能];
摘要
This paper deals with the issue of early detection of network element outages. Timeliness of outage detection as well as accuracy in finding outages on equipment in a telecommunication network depend on the monitoring system used and its performance. The intent of this paper is to investigate and propose a complementary solution to improve the performance of the existing systems in detecting faults earlier than it was able to do before. In developing our approach two constraints are given. The existing operational environment cannot be changed; threshold tuning and parameter changing cannot be done; furthermore no additional infrastructure investment has been planned. Hence, our approach relies on an alternative method based on a two-stage hybrid statistical and diagnostic detector which we designed in a way that exploits additional available data and avoids alarm monitoring system imperfections. The role of this detector is twofold: early detection of network element outages based on customer trouble calls and rule-based derision making for faulty-element isolation based on knowledge derived from fault and network management data. In this paper we present results of statistical analysis of trouble-reporting data. The analysis showed that the timing of customers' trouble reports and their content have information potential that can be utilized for early detection of outages. The detector is explained in detail and its accuracy and reduction delay is evaluated. The method presented can reduce the outage detection delay time by 233 h on average observed in relation to the performance of an existing fault management process which was designed to detect outages solely on the basis of an alarm monitoring system, for the "difficulties in work" type of malfunction. We attained an overall probability of correct detection of 95.3%. Out of the total number of outages that hypothetically could be detected, by using this method we were able to detect 77.5% of cases 1 h before the alarm was raised in the existing alarm system, while 23% of cases were detected 4 h before the actual alarm. The approach has been tested on real telecommunication network data over the period of one year. (C) 2015 Elsevier B.V. All rights reserved.
引用
收藏
页码:57 / 73
页数:17
相关论文
共 60 条
[1]
On expected detection delays for alarm systems with deadbands and delay-timers [J].
Adnan, Naseeb Ahmed ;
Izadi, Iman ;
Chen, Tongwen .
JOURNAL OF PROCESS CONTROL, 2011, 21 (09) :1318-1331
[2]
Hybrid Approach for Detection of Anomaly Network Traffic using Data Mining Techniques [J].
Agarwal, Basant ;
Mittal, Namita .
2ND INTERNATIONAL CONFERENCE ON COMMUNICATION, COMPUTING & SECURITY [ICCCS-2012], 2012, 1 :996-1003
[3]
Integration of techniques for early fault detection and diagnosis for improving process safety: Application to a Fluid Catalytic Cracking refinery process [J].
Agudelo, Carlos ;
Morant Anglada, Francisco ;
Quiles Cucarella, Eduardo ;
Garcia Moreno, Emilio .
JOURNAL OF LOSS PREVENTION IN THE PROCESS INDUSTRIES, 2013, 26 (04) :660-665
[4]
Intrusion detection alarms reduction using root cause analysis and clustering [J].
Al-Mamory, Safaa O. ;
Zhang, Hongli .
COMPUTER COMMUNICATIONS, 2009, 32 (02) :419-430
[5]
Amershi S, 2011, 29TH ANNUAL CHI CONFERENCE ON HUMAN FACTORS IN COMPUTING SYSTEMS, P157
[6]
[Anonymous], 2001, INT JOINT C ARTIFICI
[7]
Ashfaq A.B., 2009, J COMPUTER VIROLOGY, V7, P63
[8]
Barford P, 2002, IMW 2002: PROCEEDINGS OF THE SECOND INTERNET MEASUREMENT WORKSHOP, P71, DOI 10.1145/637201.637210
[9]
Bonab M.I., 2009, INT J COMPUTER SCI N, V9
[10]
Ensemble methods for anomaly detection and distributed intrusion detection in Mobile Ad-Hoc Networks [J].
Cabrera, Joao B. D. ;
Gutierrez, Carlos ;
Mehra, Raman K. .
INFORMATION FUSION, 2008, 9 (01) :96-119