Fast agglomerative clustering using a k-nearest neighbor graph

被引:251
作者
Franti, Pasi [1 ]
Virmajoki, Olli [1 ]
Hautamaki, Ville [1 ]
机构
[1] Univ Joensuu, Dept Comp Sci, Speech & Image Proc Unit, FIN-80101 Joensuu, Finland
关键词
clustering; agglomeration; nearest neighbor; vector quantization; PNN;
D O I
10.1109/TPAMI.2006.227
中图分类号
TP18 [人工智能理论];
学科分类号
081104 [模式识别与智能系统]; 0812 [计算机科学与技术]; 0835 [软件工程]; 1405 [智能科学与技术];
摘要
We propose a fast agglomerative clustering method using an approximate nearest neighbor graph for reducing the number of distance calculations. The time complexity of the algorithm is improved from Od(tau N-2) to Od (tau N logN) at the cost of a slight increase in distortion; here, tau denotes the number of nearest neighbor updates required at each iteration. According to the experiments, a relatively small neighborhood size is sufficient to maintain the quality close to that of the full search.
引用
收藏
页码:1875 / 1881
页数:7
相关论文
共 25 条
[1]
An automatic shape independent clustering technique [J].
Bandyopadhyay, S .
PATTERN RECOGNITION, 2004, 37 (01) :33-45
[2]
Conway J. H., 1998, SPHERE PACKINGS LATT
[3]
CORMEN T, 1998, INTRO ALGORITHMS
[4]
A NEW VECTOR QUANTIZATION CLUSTERING-ALGORITHM [J].
EQUITZ, WH .
IEEE TRANSACTIONS ON ACOUSTICS SPEECH AND SIGNAL PROCESSING, 1989, 37 (10) :1568-1575
[5]
Fast and memory efficient implementation of the exact PNN [J].
Fränti, P ;
Kaukoranta, T ;
Shen, DF ;
Chang, KS .
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2000, 9 (05) :773-777
[6]
Fast PNN-based clustering using K-nearest neighbor graph [J].
Fränti, P ;
Virmajoki, O ;
Hautamäki, V .
THIRD IEEE INTERNATIONAL CONFERENCE ON DATA MINING, PROCEEDINGS, 2003, :525-528
[7]
Friedman J. H., 1977, ACM Transactions on Mathematical Software, V3, P209, DOI 10.1145/355744.355745
[8]
GOVER JC, 1969, APPL STAT, V18, P54
[9]
HAREL D, 2001, P 7 ACM SIGKDD INT C, P281
[10]
Chameleon: Hierarchical clustering using dynamic modeling [J].
Karypis, G ;
Han, EH ;
Kumar, V .
COMPUTER, 1999, 32 (08) :68-+