Edge-based mining of frequent subgraphs from graph streams

被引:15
作者
Cuzzocrea, Alfredo [1 ,2 ]
Han, Zhao [3 ]
Jiang, Fan [3 ]
Leung, Carson K. [3 ]
Zhang, Hao [3 ]
机构
[1] Univ Trieste, Dept Engn & Architecture DIA, I-34127 Trieste, TS, Italy
[2] ICAR CNR, I-34127 Trieste, TS, Italy
[3] Univ Manitoba, Dept Comp Sci, Winnipeg, MB R3T 2N2, Canada
来源
KNOWLEDGE-BASED AND INTELLIGENT INFORMATION & ENGINEERING SYSTEMS 19TH ANNUAL CONFERENCE, KES-2015 | 2015年 / 60卷
关键词
Knowledge discovery and data mining; frequent patterns; frequent subgraphs; graph structured data; data streams;
D O I
10.1016/j.procs.2015.08.184
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In the current era of Big data, high volumes of valuable data can be generated at a high velocity from high-varieties of data sources in various real-life applications ranging from sensor networks to social networks, from bio-informatics to chemical informatics. In addition, Big data are also available in business, education, engineering, finance, healthcare, scientific, telecommunication, and transportation domains. A collection of these data can be viewed as a big dynamic graph structure. Embedded in them are implicit, previously unknown, and potentially useful knowledge. Consequently, efficient knowledge discovery algorithms for mining frequent subgraphs from these dynamic streaming graph structured data are in demand. On the one hand, some existing algorithms discover collections of frequently co-occurring edges, which may be disjoint. On the other hand, some other existing algorithms discover frequent subgraphs by requiring very large memory space. With high volumes of Big data, available memory space may be limited. To discover collections of frequently co-occurring connected edges, we present in this paper two efficient algorithms that require small memory space. Evaluation results show the efficiency of our edge-based algorithms in mining frequent subgraphs from graph streams. (C) 2015 The Authors. Published by Elsevier B.V.
引用
收藏
页码:573 / 582
页数:10
相关论文
共 32 条
[1]   On Dense Pattern Mining in Graph Streams [J].
Aggarwal, Charu C. ;
Li, Yao ;
Yu, Philip S. ;
Jin, Ruoming .
PROCEEDINGS OF THE VLDB ENDOWMENT, 2010, 3 (01) :975-984
[2]  
Bifet Albert., 2011, P 17 ACM SIGKDD INT, P591, DOI DOI 10.1145/2020408.2020501
[3]   Effectively and efficiently mining frequent patterns from dense graph streams on disk [J].
Braun, Peter ;
Cameron, Juan J. ;
Cuzzocrea, Alfredo ;
Jiang, Fan ;
Leung, Carson K. .
KNOWLEDGE-BASED AND INTELLIGENT INFORMATION & ENGINEERING SYSTEMS 18TH ANNUAL CONFERENCE, KES-2014, 2014, 35 :338-347
[4]   A tree-based algorithm for mining diverse social entities [J].
Braun, Peter ;
Cuzzocrea, Alfredo ;
Leung, Carson K. ;
MacKinnon, Richard Kyle ;
Tanbeer, Syed K. .
KNOWLEDGE-BASED AND INTELLIGENT INFORMATION & ENGINEERING SYSTEMS 18TH ANNUAL CONFERENCE, KES-2014, 2014, 35 :223-232
[5]  
Cameron J.J., 2013, P 28 ANN ACM S APPL, P173
[6]  
Cuzzocrea A, 2004, LECT NOTES COMPUT SC, V3292, P144
[7]  
Cuzzocrea A., 2015, P EDBT ICDT WORKSH 2, P237
[8]   Event-based lossy compression for effective and efficient OLAP over data streams [J].
Cuzzocrea, Alfredo ;
Chakravarthy, Sharma .
DATA & KNOWLEDGE ENGINEERING, 2010, 69 (07) :678-708
[9]  
Cuzzocrea A, 2009, LECT NOTES COMPUT SC, V5691, P48, DOI 10.1007/978-3-642-03730-6_5
[10]   Ensemble classifier for mining data streams [J].
Czarnowski, Ireneusz ;
Jedrzejowicz, Piotr .
KNOWLEDGE-BASED AND INTELLIGENT INFORMATION & ENGINEERING SYSTEMS 18TH ANNUAL CONFERENCE, KES-2014, 2014, 35 :397-406