Clustering and community detection in directed networks: A survey

被引:586
作者
Malliaros, Fragkiskos D. [1 ]
Vazirgiannis, Michalis [1 ,2 ]
机构
[1] Ecole Polytech, Comp Sci Lab, F-91120 Palaiseau, France
[2] Athens Univ Econ & Business, Dept Informat, Athens 10434, Greece
来源
PHYSICS REPORTS-REVIEW SECTION OF PHYSICS LETTERS | 2013年 / 533卷 / 04期
关键词
Community detection; Graph clustering; Directed networks; Complex networks; Graph mining; COMPLEX NETWORKS; DIGRAPH LAPLACIAN; RANDOM-WALKS; MODULARITY; IDENTIFICATION; MODELS;
D O I
10.1016/j.physrep.2013.08.002
中图分类号
O4 [物理学];
学科分类号
070305 [高分子化学与物理];
摘要
Networks (or graphs) appear as dominant structures in diverse domains, including sociology, biology, neuroscience and computer science. In most of the aforementioned cases graphs are directed - in the sense that there is directionality on the edges, making the semantics of the edges nonsymmetric as the source node transmits some property to the target one but not vice versa. An interesting feature that real networks present is the clustering or community structure property, under which the graph topology is organized into modules commonly called communities or clusters. The essence here is that nodes of the same community are highly similar while on the contrary, nodes across communities present low similarity. Revealing the underlying community structure of directed complex networks has become a crucial and interdisciplinary topic with a plethora of relevant application domains. Therefore, naturally there is a recent wealth of research production in the area of mining directed graphs - with clustering being the primary method sought and the primary tool for community detection and evaluation. The goal of this paper is to offer an in-depth comparative review of the methods presented so far for clustering directed networks along with the relevant necessary methodological background and also related applications. The survey commences by offering a concise review of the fundamental concepts and methodological base on which graph clustering algorithms capitalize on. Then we present the relevant work along two orthogonal classifications. The first one is mostly concerned with the methodological principles of the clustering algorithms, while the second one approaches the methods from the viewpoint regarding the properties of a good cluster in a directed network. Further, we present methods and metrics for evaluating graph clustering results, demonstrate interesting application domains and provide promising future research directions. (C) 2013 Elsevier B.V. All rights reserved.
引用
收藏
页码:95 / 142
页数:48
相关论文
共 154 条
[1]
Aggarwal C. C., 2010, SURVEY CLUSTERING AL
[2]
Airoldi EM, 2008, J MACH LEARN RES, V9, P1981
[3]
Andersen R, 2007, LECT NOTES COMPUT SC, V4863, P166
[4]
[Anonymous], 2005, ICML '05, DOI [10.1145/1102351.1102482, DOI 10.1145/1102351.1102482]
[5]
[Anonymous], 2005, Generalized Blockmodeling
[6]
[Anonymous], LARGE SCALE STRUCTUR
[7]
[Anonymous], MULTIWAY CUTS SPECTR
[8]
[Anonymous], COMMUNITY DISCOVERY
[9]
[Anonymous], PHYS REV E 2
[10]
[Anonymous], 2 INT WORKSH COMB SC