IDENTIFYING CLUSTER OVERLAP WITH NORMIX POPULATION MEMBERSHIP PROBABILITIES

被引:7
作者
PRICE, LJ
机构
[1] Department of Marketing, INSEAD, 77305 Fontainebleau Cedex, Boulevard de Constance
关键词
D O I
10.1207/s15327906mbr2802_5
中图分类号
O1 [数学];
学科分类号
0701 ; 070101 ;
摘要
Prior research has suggested that the NORMIX clustering procedure performs poorly under conditions of cluster overlap. In each previous study, however, the criterion of clustering accuracy specified that an entity be assigned to the single population from which it was sampled. NORMIX population membership probabilities were used for this purpose by assigning entities to the single population of highest probability. It is argued in this research that, when populations overlap, a more appropriate criterion of accuracy would recognize that entities which fall in the overlapping regions of populations either belong to multiple populations or they belong to a single population whose identity is indeterminate. It is further argued that NORMIX probabilities can be useful in identifying the entities from such overlapping regions. The results of a Monte Carlo test support this argument.
引用
收藏
页码:235 / 262
页数:28
相关论文
共 22 条
[1]   MONTE-CARLO COMPARISONS OF SELECTED CLUSTERING PROCEDURES [J].
BAYNE, CK ;
BEAUCHAMP, JJ ;
BEGOVICH, CL ;
KANE, VE .
PATTERN RECOGNITION, 1980, 12 (02) :51-62
[2]  
CATTELL RB, 1949, PSYCHOMETRIKA, V14, P279
[3]   OMEGA - A GENERAL FORMULATION OF THE RAND INDEX OF CLUSTER RECOVERY SUITABLE FOR NON-DISJOINT SOLUTIONS [J].
COLLINS, LM ;
DENT, CW .
MULTIVARIATE BEHAVIORAL RESEARCH, 1988, 23 (02) :231-242
[4]  
CONNOR WS, 1961, AD700470
[6]  
Everitt B., 1974, CLUSTER ANAL
[8]  
Hartigan JA., 1975, CLUSTERING ALGORITHM
[9]   MONTE-CARLO COMPARISON OF 6 CLUSTERING PROCEDURES [J].
KUIPER, FK ;
FISHER, L .
BIOMETRICS, 1975, 31 (03) :777-783
[10]  
MacQueen J., 1967, 5 BERKELEY S MATH ST, V1, P281