A PRELIMINARY-STUDY OF OPTIMAL VARIABLE WEIGHTING IN K-MEANS CLUSTERING

被引:58
作者
GREEN, PE [1 ]
CARMONE, FJ [1 ]
KIM, J [1 ]
机构
[1] DREXEL UNIV,DEPT MKT,PHILADELPHIA,PA 19104
关键词
Cross validation; k-means clustering; optimal weighting; Rand index;
D O I
10.1007/BF01908720
中图分类号
O1 [数学];
学科分类号
0701 ; 070101 ;
摘要
Recently, algorithms for optimally weighting variables in non-hierarchical and hierarchical clustering methods have been proposed. Preliminary Monte Carlo research has shown that at least one of these algorithms cross-validates extremely well. The present study applies a k-means, optimal weighting procedure to two empirical data sets and contrasts its cross-validation performance with that of unit (i.e., equal) weighting of the variables. We find that the optimal weighting procedure cross-validates better in one of the two data sets. In the second data set its comparative performance strongly depends on the approach used to find seed values for the initial k-means partitioning. © 1990 Springer-Verlag New York Inc.
引用
收藏
页码:271 / 285
页数:15
相关论文
共 39 条
[1]  
Aldenderfer, 1978, APPLIED PSYCHOL MEAS, V2, P533, DOI [10.1177/014662167800200408, DOI 10.1177/014662167800200408]
[2]   OVERLAPPING CLUSTERING - A NEW METHOD FOR PRODUCT POSITIONING [J].
ARABIE, P ;
CARROLL, JD ;
DESARBO, W ;
WIND, J .
JOURNAL OF MARKETING RESEARCH, 1981, 18 (03) :310-317
[3]  
ARABIE PA, 1982, CLASSIFYING SOCIAL D
[4]   MIXTURE MODEL TESTS OF CLUSTER-ANALYSIS - ACCURACY OF 4 AGGLOMERATIVE HIERARCHICAL METHODS [J].
BLASHFIELD, RK .
PSYCHOLOGICAL BULLETIN, 1976, 83 (03) :377-388
[5]   COMPARING INTERPOINT DISTANCES IN CORRESPONDENCE-ANALYSIS - A CLARIFICATION [J].
CARROLL, JD ;
GREEN, PE ;
SCHAFFER, CM .
JOURNAL OF MARKETING RESEARCH, 1987, 24 (04) :445-450
[6]   OMEGA - A GENERAL FORMULATION OF THE RAND INDEX OF CLUSTER RECOVERY SUITABLE FOR NON-DISJOINT SOLUTIONS [J].
COLLINS, LM ;
DENT, CW .
MULTIVARIATE BEHAVIORAL RESEARCH, 1988, 23 (02) :231-242
[7]   A REVIEW OF MULTIDIMENSIONAL-SCALING IN MARKETING-RESEARCH [J].
COOPER, LG .
APPLIED PSYCHOLOGICAL MEASUREMENT, 1983, 7 (04) :427-450
[8]   CONSTRAINED CLASSIFICATION - THE USE OF A PRIORI INFORMATION IN CLUSTER-ANALYSIS [J].
DESARBO, WS ;
MAHAJAN, V .
PSYCHOMETRIKA, 1984, 49 (02) :187-215
[9]  
DESARBO WS, 1984, PSYCHOMETRIKA, V49, P59
[10]   OPTIMAL VARIABLE WEIGHTING FOR ULTRAMETRIC AND ADDITIVE TREE CLUSTERING [J].
DESOETE, G .
QUALITY & QUANTITY, 1986, 20 (2-3) :169-180