Finding maximal homogeneous clique sets

被引:5
作者
Mougel, Pierre-Nicolas [1 ]
Rigotti, Christophe [1 ]
Plantevit, Marc [2 ]
Gandrillon, Olivier [3 ]
机构
[1] Univ Lyon, LIRIS, INSA Lyon, INRIA,CNRS,UMR5205, F-69621 Lyon, France
[2] Univ Lyon 1, LIRIS, CNRS, UMR5205, F-69622 Lyon, France
[3] Univ Lyon 1, Ctr Genet & Physiol Mol & Cellulaire, INRIA, CGPhiMC,CNRS,UMR5534, F-69622 Lyon, France
关键词
Graph mining; Interaction network; Attributed graph; Clique set; Homogeneous cliques; Scientific collaborations; Protein interactions; Gene expression; FREQUENT PATTERN; GRAPH;
D O I
10.1007/s10115-013-0625-y
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Many datasets can be encoded as graphs with sets of labels associated with the vertices. We consider this kind of graphs and we propose to look for patterns called maximal homogeneous clique sets, where such a pattern is a subgraph that is structured in several large cliques and where all vertices share enough labels. We present an algorithm based on graph enumeration to compute all patterns satisfying user-defined constraints on the number of separated cliques, on the size of these cliques, and on the number of labels shared by all the vertices. Our approach is tested on real datasets based on a social network of scientific collaborations and on a biological network of protein-protein interactions. The experiments show that the patterns are useful to exhibit subgraphs organized in several core modules of interactions. Performances are reported on real data and also on synthetic ones, showing that the approach can be applied on different kinds of large datasets.
引用
收藏
页码:579 / 608
页数:30
相关论文
共 38 条
[31]  
Nguyen K., 2011, P 11 SIAM INT C DAT, P570
[32]   Mining Attribute-structure Correlated Patterns in Large Attributed Graphs [J].
Silva, Arlei ;
Meira, Wagner, Jr. ;
Zaki, Mohammed J. .
PROCEEDINGS OF THE VLDB ENDOWMENT, 2012, 5 (05) :466-477
[33]   The worst-case time complexity for generating all maximal cliques and computational experiments [J].
Tomita, Etsuji ;
Tanaka, Akira ;
Takahashi, Haruhisa .
THEORETICAL COMPUTER SCIENCE, 2006, 363 (01) :28-42
[34]   Identification of functional modules using network topology and high-throughput data [J].
Ulitsky, Igor ;
Shamir, Ron .
BMC SYSTEMS BIOLOGY, 2007, 1
[35]  
Xu Z., 2012, P 2012 ACM SIGMOD IN, P505
[36]  
Yan XF, 2002, 2002 IEEE INTERNATIONAL CONFERENCE ON DATA MINING, PROCEEDINGS, P721, DOI 10.1109/ICDM.2002.1184038
[37]  
Yang ZH, 2010, PROCEEDINGS OF INTERNATIONAL CONFERENCE ON RESOURCE ENVIRONMENT AND INFORMATION TECHNOLOGY IN 2010 (REIT' 2010), P689, DOI 10.1109/ICDM.2010.41
[38]  
Zhou Y, 2009, PROCEEDINGS OF THE VLDB ENDOWMENT, V2