Testing of clustering

被引:13
作者
Alon, N [1 ]
Dar, S [1 ]
Parnas, M [1 ]
Ron, D [1 ]
机构
[1] Tel Aviv Univ, Dept Math, Ramat Aviv, Israel
来源
41ST ANNUAL SYMPOSIUM ON FOUNDATIONS OF COMPUTER SCIENCE, PROCEEDINGS | 2000年
关键词
D O I
10.1109/SFCS.2000.892111
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
A set X of points in R-d is (k, b)-clusterable if X can be partitioned into k subsets (clusters) so that the diameter (alternatively, the radius) of each cluster is at most b. We present algorithms that by sampling from a set X, distinguish between the case that X is (k, b)-clusterable and the case that X is epsilon -far from being (k, b')-clusterable for any given 0 < <epsilon> less than or equal to 1 and for b' greater than or equal to b. In epsilon -far from being (k, b')-clusterable we mean that more than epsilon. \X\ points should be removed from X so that it becomes (Ic, b')-clusterable. We give algorithms for a variety of cost measures that use a sample of size independent of \X\, and polynomial in Ic and 1/epsilon. Our algorithms can also be used to find approximately good clusterings. Namely, these are clusterings of all but an e-fraction of the points in X that have optimal (or close to optimal) cost. The benefit of our algorithms is that they construct an implicit representation of such clusterings in time independent of \X\. That is, without actually having to partition all points in X, the implicit representation can be used to answer queries concerning the cluster any given point belongs to.
引用
收藏
页码:240 / 250
页数:11
相关论文
共 41 条
[1]  
AGRAWAL PK, 1998, P SODA, P658
[2]  
AGRAWAL PK, 1998, ACM COMPUT SURV, P413
[3]  
ALON N, 1999, P 40 ANN S FDN COMP, P645
[4]  
ALON N, 1999, P 40 ANN S FDN COMP, P656
[5]  
ALON N, 2000, TESTING CLUSTERING
[6]  
ALON N, 1999, UNPUB TESTING K COLO
[7]  
Anderberg M.R., 1973, Probability and Mathematical Statistics
[8]  
BENDER M, 2000, P ICALP, P809
[9]   SELF-TESTING CORRECTING WITH APPLICATIONS TO NUMERICAL PROBLEMS [J].
BLUM, M ;
LUBY, M ;
RUBINFELD, R .
JOURNAL OF COMPUTER AND SYSTEM SCIENCES, 1993, 47 (03) :549-595
[10]  
Dodis Yevgeniy, 1999, P 3 INT WORKSH RAND, P97