Assessing the Suitability of Network Community Detection to Available Meta-Data using Rank Stability

被引:8
作者
Hartman, Ryan [1 ]
Faustino, Josemar [1 ]
Pinheiro, Diego [1 ]
Menezes, Ronaldo [1 ]
机构
[1] Florida Inst Technol, Sch Comp, BioComplex Lab, Melbourne, FL 32901 USA
来源
2017 IEEE/WIC/ACM INTERNATIONAL CONFERENCE ON WEB INTELLIGENCE (WI 2017) | 2017年
基金
美国国家科学基金会;
关键词
community detection; meta-data; rank stability; COMPLEX NETWORKS;
D O I
10.1145/3106426.3106493
中图分类号
TP18 [人工智能理论];
学科分类号
140502 [人工智能];
摘要
In the last two decades, we have witnessed the widespread use of structural analysis of data. The area, generally called Network Science, concentrates on understanding complex phenomena by looking for properties that emerge from the relationships between the pieces of data instead of the traditional mining of the data itself. A commonly used structural analysis in networks consists of finding subgraphs whose density of connections within the subgraph surpasses that of outside connections; called Community Detection. Many techniques have been proposed to find communities as well as benchmarks to evaluate the algorithms ability to find these substructures. Until recently, the literature has mostly neglected the fact that these communities often represent common characteristic of the elements in the community. For instance, in a social network, communities could represent: people who follow the same particular sport, people from the same classroom, authors working in the same field of study, to name a few. The problem here is one of community detection selection as a function of the ground truth provided by available meta-data. In this work, we propose the use of rank stability (entropy of ranks) to assess communities identified using different techniques from the perspective of meta-data. We validate our approach using a large-scale data set of on-line social interactions across multiple community detection techniques.
引用
收藏
页码:162 / 169
页数:8
相关论文
共 27 条
[1]
[Anonymous], 2010, P 19 INT C WORLD WID, DOI DOI 10.1145/1772690.1772755
[2]
[Anonymous], 2008, Proceeding of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, ACM
[3]
[Anonymous], 2017, LOYALTY ONLINE COMMU
[4]
[Anonymous], GROUND TRUTH METADAT
[5]
[Anonymous], DIRECTED LOUVAIN MAX
[6]
Condon A., 1999, Randomization, Approximation, and Combinatorial Optimization. Algorithms and Techniques. Third International Workshop on Radomization and Approximation Techniques in Computer Science, and Second International Workshop on Approximation Algorithms for Combinatorial Optimization Problems RANDOM-APPROX'99. Proceedings (Lecture Notes in Computer Science Vol.1671), P221
[7]
Comparing community structure identification -: art. no. P09008 [J].
Danon, L ;
Díaz-Guilera, A ;
Duch, J ;
Arenas, A .
JOURNAL OF STATISTICAL MECHANICS-THEORY AND EXPERIMENT, 2005, :219-228
[8]
Resolution limit in community detection [J].
Fortunato, Santo ;
Barthelemy, Marc .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2007, 104 (01) :36-41
[9]
Community detection in networks: A user guide [J].
Fortunato, Santo ;
Hric, Darko .
PHYSICS REPORTS-REVIEW SECTION OF PHYSICS LETTERS, 2016, 659 :1-44
[10]
Community detection in graphs [J].
Fortunato, Santo .
PHYSICS REPORTS-REVIEW SECTION OF PHYSICS LETTERS, 2010, 486 (3-5) :75-174