On Dense Pattern Mining in Graph Streams

被引:31
作者
Aggarwal, Charu C. [1 ]
Li, Yao [2 ]
Yu, Philip S. [2 ]
Jin, Ruoming [3 ]
机构
[1] IBM TJ Watson Res Ctr, Hawthorne, NY 10562 USA
[2] Univ Illinois, Chicago, IL USA
[3] Kent State Univ, Kent, OH 44242 USA
来源
PROCEEDINGS OF THE VLDB ENDOWMENT | 2010年 / 3卷 / 01期
关键词
D O I
10.14778/1920841.1920964
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Many massive web and communication network applications create data which can be represented as a massive sequential stream of edges. For example, conversations in a telecommunication network or messages in a social network can be represented as a massive stream of edges. Such streams are typically very large, because of the large amount of underlying activity in such networks. An important application in these domains is to determine frequently occurring dense structures in the underlying graph stream. In general, we would like to determine frequent and dense patterns in the underlying interactions. We introduce a model for dense pattern mining and propose probabilistic algorithms for determining such structural patterns effectively and efficiently. The purpose of the probabilistic approach is to create a summarization of the graph stream, which can be used for further pattern mining. We show that this summarization approach leads to effective and efficient results for stream pattern mining over a number of real and synthetic data sets.
引用
收藏
页码:975 / 984
页数:10
相关论文
共 13 条
[1]  
Abello J., 2002, LATIN C
[2]  
Aggarwal CC, 2010, ADV DATABASE SYST, V40, P1, DOI 10.1007/978-1-4419-6045-0
[3]  
Chi Y., 2004, ICDM C
[4]  
Cohen E., 2001, IEEE TKDE, V13
[5]  
Gibson D., 2005, VLDM C
[6]  
He H., 2007, ICDM C
[7]  
Jin R., 2005, ICDM C
[8]  
Navlakha S., 2008, SIGMOD
[9]  
Pei J., 2005, KDD
[10]  
Ranu S., 2009, ICDM C