What's new: Finding significant differences in network data streams

被引:83
作者
Cormode, G
Muthukrishnan, S
机构
[1] Rutgers State Univ, Ctr Discrete Math & Comp Sci DIMAC, Piscataway, NJ 08845 USA
[2] Rutgers State Univ, Div Comp & Informat Syst, Piscataway, NJ 08854 USA
关键词
change detection; data streams; deltoids; network data analysis;
D O I
10.1109/TNET.2005.860096
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 [计算机科学与技术];
摘要
Monitoring and analyzing network traffic usage patterns is vital for managing IP Networks. An important problem is to provide network managers with information about changes in traffic, informing them about "what's new." Specifically, we focus on the challenge of finding significantly large differences in traffic: over time, between interfaces and between routers. We introduce the idea of a deltoid: an item that has a large difference, whether the difference is absolute, relative or variational. We present novel algorithms for finding the most significant deltoids in high-speed traffic data, and prove that they use small space, very small time per update, and are guaranteed to find significant deltoids with pre-specified accuracy. In experimental evaluation with real network traffic, our algorithms perform well and recover almost all deltoids. This is the first work to provide solutions capable of working over the data with one pass, at network traffic speeds.
引用
收藏
页码:1219 / 1232
页数:14
相关论文
共 31 条
[1]
Alon N., 1999, Proceedings of the Eighteenth ACM SIGMOD-SIGACT-SIGART Symposium on Principles of Database Systems, P10, DOI 10.1145/303976.303978
[2]
Alon N., 1996, Proceedings of the Twenty-Eighth Annual ACM Symposium on the Theory of Computing, P20, DOI 10.1145/237814.237823
[3]
Babcock B., 2002, Proceedings of the Twenty-First ACM SIGMOD-SIGACT-SIGART Symposium on Principles of Database Systems (PODS), P1, DOI DOI 10.1145/543613.543615
[4]
Charikar M., 2002, P 29 INT C AUT LANG, P693, DOI 10.1007/3-540-45465-9_59
[5]
An improved data stream summary: the count-min sketch and its applications [J].
Cormode, G ;
Muthukrishnan, S .
JOURNAL OF ALGORITHMS-COGNITION INFORMATICS AND LOGIC, 2005, 55 (01) :58-75
[6]
Cormode G, 2003, LECT NOTES COMPUT SC, V2832, P148
[7]
Cormode G., 2002, Proceedings of the Twenty-eighth International Conference on Very Large Data Bases, P335
[8]
Cormode G., 2003, ACM Transactions on Database Systems (TODS), P296, DOI DOI 10.1145/1061318.1061325
[9]
Cormode Graham, 2003, P VLDB, P464
[10]
Datar M, 2002, LECT NOTES COMPUT SC, V2461, P323