Adaptive Anomaly Detection in Cloud using Robust and Scalable Principal Component Analysis

被引:16
作者
Agrawal, Bikash [1 ]
Wiktorski, Tomasz [1 ]
Rong, Chunming [1 ]
机构
[1] Univ Stavanger, Dept Comp & Elect Engn, Stavanger, Norway
来源
2016 15TH INTERNATIONAL SYMPOSIUM ON PARALLEL AND DISTRIBUTED COMPUTING (ISPDC) | 2016年
关键词
Anomaly detection; Outlier detection; Robust PCA; Spark; Hadoop; Data center; performance analysis; SVD;
D O I
10.1109/ISPDC.2016.22
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
080201 [机械制造及其自动化];
摘要
This paper proposes a novel and scalable model for automatic anomaly detection on a large system such as a cloud. Anomaly detection issues early warning of unusual behavior in dynamic environments by learning system characteristic from normal operational data. Anomaly detection in large systems is difficult to detect due heterogeneity, dynamicity, scalability, hidden complexity, and time limitation. To detect anomalous activity in the cloud, we need to monitor the datacenter and collect cloud performance data. In this paper, we propose an adaptive anomaly detection mechanism which investigates principal components of performance metrics. It transforms the performance metrics into a low-rank matrix and then calculates the orthogonal distance using the Robust PCA algorithm. The proposed model updates itself recursively learning and adjusting the new threshold value in order to minimize reconstruction errors. This paper also investigates the robust principal component analysis in distributed environments using Apache Spark as the underlying framework, specifically addressing cases in which a normal operation might exhibit multiple hidden modes. The accuracy and sensitivity of the model is tested on Google data center traces and Yahoo! datasets. The model achieves an 87.24% accuracy.
引用
收藏
页码:100 / 106
页数:7
相关论文
共 29 条
[1]
[Anonymous], 2013, P 2013 WORKSH DATA M
[2]
[Anonymous], 2011, Tech. Rep.
[3]
[Anonymous], 2003, TECH REP
[4]
[Anonymous], ALG TRENDSP MEAN INT
[5]
[Anonymous], P OSDI
[6]
[Anonymous], 2011, P 2011 INT C INF SCI, DOI DOI 10.1109/ICISA.2011.5772437
[7]
[Anonymous], INT CMG C
[8]
[Anonymous], JAIDM
[9]
Robust Principal Component Analysis? [J].
Candes, Emmanuel J. ;
Li, Xiaodong ;
Ma, Yi ;
Wright, John .
JOURNAL OF THE ACM, 2011, 58 (03)
[10]
Celik M., 2011, 2011 International Symposium on Innovations in Intelligent Systems and Applications (INISTA 2011), P91, DOI 10.1109/INISTA.2011.5946052