Resampling method for unsupervised estimation of cluster validity

被引:171
作者
Levine, E [1 ]
Domany, E [1 ]
机构
[1] Weizmann Inst Sci, Dept Phys Complex Syst, IL-76100 Rehovot, Israel
关键词
D O I
10.1162/089976601753196030
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We introduce a method for validation of results obtained by clustering analysis of data. The method is based on resampling the available data. A figure of merit that measures the stability of clustering solutions against resampling is introduced. Clusters that are stable against resampling give rise to local maxima of this figure of merit. This is presented first for a one-dimensional data set, for which an analytic approximation for the figure of merit is derived and compared with numerical measurements. Next, the applicability of the method is demonstrated for higher-dimensional data, including gene microarray expression data.
引用
收藏
页码:2573 / 2593
页数:21
相关论文
共 25 条
[1]   Broad patterns of gene expression revealed by clustering analysis of tumor and normal colon tissues probed by oligonucleotide arrays [J].
Alon, U ;
Barkai, N ;
Notterman, DA ;
Gish, K ;
Ybarra, S ;
Mack, D ;
Levine, AJ .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 1999, 96 (12) :6745-6750
[2]  
[Anonymous], Pattern Recognition With Fuzzy Objective Function Algorithms
[3]  
Bezdek J. C., 1995, Proceedings. 1995. Second New Zealand International Two-Stream Conference on Artificial Neural Networks and Expert Systems, P190, DOI 10.1109/ANNES.1995.499469
[4]   Superparamagnetic clustering of data [J].
Blatt, M ;
Wiseman, S ;
Domany, E .
PHYSICAL REVIEW LETTERS, 1996, 76 (18) :3251-3254
[5]   ON SOME SIGNIFICANCE TESTS IN CLUSTER-ANALYSIS [J].
BOCK, HH .
JOURNAL OF CLASSIFICATION, 1985, 2 (01) :77-108
[6]   Exploring the new world of the genome with DNA microarrays [J].
Brown, PO ;
Botstein, D .
NATURE GENETICS, 1999, 21 (Suppl 1) :33-37
[7]   Accessing genetic information with high-density DNA arrays [J].
Chee, M ;
Yang, R ;
Hubbell, E ;
Berno, A ;
Huang, XC ;
Stern, D ;
Winkler, J ;
Lockhart, DJ ;
Morris, MS ;
Fodor, SPA .
SCIENCE, 1996, 274 (5287) :610-614
[8]  
Cover T. M., 2005, ELEM INF THEORY, DOI 10.1002/047174882X
[9]  
Cutler A., 1994, P FRST USJAPAN C FRO, P149
[10]  
DAVIES DL, 1979, IEEE T PATTERN ANAL, P224