SCAN: A Structural Clustering Algorithm for Netwo

被引:556
作者
Xu, Xiaowei [1 ]
Yuruk, Nurcan [1 ]
Feng, Zhidan [1 ]
Schweiger, Thomas A. J. [2 ]
机构
[1] Univ Arkansas, Little Rock, AR 72204 USA
[2] Acxiom Corp, Little Rock, AR USA
来源
KDD-2007 PROCEEDINGS OF THE THIRTEENTH ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING | 2007年
关键词
Network clustering; Graph partitioning; Community Structure; Hubs; Outliers;
D O I
10.1145/1281192.1281280
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Network clustering (or graph partitioning) is an important task for the discovery of underlying structures in networks. Many algorithms find clusters by maximizing the number of intra-cluster edges. While such algorithms find useful and interesting structures, they tend to fail to identify and isolate two kinds of vertices that play special roles - vertices that bridge clusters (hubs) and vertices that are marginally connected to clusters (outliers). Identifying hubs is useful for applications such as viral marketing and epidemiology since hubs are responsible for spreading ideas or disease. In contrast, outliers have little or no influence, and may be isolated as noise in the data. In this paper, we proposed a novel algorithm called SCAN (Structural Clustering Algorithm for Networks), which detects clusters, hubs and outliers in networks. It clusters vertices based on a structural similarity measure. The algorithm is fast and efficient, visiting each vertex only once. An empirical evaluation of the method using both synthetic and real datasets demonstrates superior performance over other methods such as the modularity-based algorithms.
引用
收藏
页码:824 / +
页数:3
相关论文
共 19 条
[1]   Internet -: Diameter of the World-Wide Web [J].
Albert, R ;
Jeong, H ;
Barabási, AL .
NATURE, 1999, 401 (6749) :130-131
[2]   Network biology:: Understanding the cell's functional organization [J].
Barabási, AL ;
Oltvai, ZN .
NATURE REVIEWS GENETICS, 2004, 5 (02) :101-U15
[3]  
Clauset A, 2004, PHYS REV E, V70, DOI 10.1103/PhysRevE.70.066111
[4]  
DING C, P ICDM 2001
[5]  
Domingos P, 2001, P 7 ACM SIGKDD INT C, P57, DOI DOI 10.1145/502512.502525
[6]  
Ester M., 1996, P 2 INT C KNOWL DISC, P226, DOI DOI 10.5555/3001460.3001507
[7]  
FALOUTSOS M, 1999, POWER LAW RELATIONSH
[8]   Functional cartography of complex metabolic networks [J].
Guimerà, R ;
Amaral, LAN .
NATURE, 2005, 433 (7028) :895-900
[9]   COMPARING PARTITIONS [J].
HUBERT, L ;
ARABIE, P .
JOURNAL OF CLASSIFICATION, 1985, 2 (2-3) :193-218
[10]  
Kleinberg J.M., 1999, LECT NOTES COMPUTER, V1627, P1, DOI DOI 10.1007/3-540-48686-0_1