OCLUS: An analytic method for generating clusters with known overlap

被引:43
作者
Steinley, D
Henson, R
机构
[1] Univ Missouri, Dept Psychol Sci, Columbia, MO 65211 USA
[2] Univ N Carolina, Greensboro, NC 27412 USA
关键词
cluster generation; overlapping clusters;
D O I
10.1007/s00357-005-0015-6
中图分类号
O1 [数学];
学科分类号
0701 ; 070101 ;
摘要
The primary method for validating cluster analysis techniques is through Monte Carlo simulations that rely on generating data with known cluster structure (e.g., Milligan 1996). This paper defines two kinds of data generation mechanisms with cluster overlap, marginal and joint; current cluster generation methods are framed within these definitions. An algorithm generating overlapping clusters based on shared densities from several different multivariate distributions is proposed and shown to lead to an easily understandable notion of cluster overlap. Besides outlining the advantages of generating clusters within this framework, a discussion is given of how the proposed data generation technique can be used to augment research into current classification techniques such as finite mixture modeling, classification algorithm robustness, and latent profile analysis.
引用
收藏
页码:221 / 250
页数:30
相关论文
共 75 条
[1]  
Anderberg M.R., 1973, Probability and Mathematical Statistics
[2]  
[Anonymous], 1996, Clustering and Classification Ed. by, DOI DOI 10.1142/1930
[3]  
[Anonymous], 1986, PATTERN RECOGN
[4]  
[Anonymous], 1987, ROBUST REGRESSION OU
[5]   COMPARATIVE-EVALUATION OF 2 SUPERIOR STOPPING RULES FOR HIERARCHICAL CLUSTER-ANALYSIS [J].
ATLAS, RS ;
OVERALL, JE .
PSYCHOMETRIKA, 1994, 59 (04) :581-591
[6]   MEASURING POWER OF HIERARCHICAL CLUSTER-ANALYSIS [J].
BAKER, FB ;
HUBERT, LJ .
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 1975, 70 (349) :31-38
[7]  
BALAKRISHNAN PV, 1994, PSYCHOMETRIKA, V59, P509
[8]   A CLUSTERING TECHNIQUE FOR SUMMARIZING MULTIVARIATE DATA [J].
BALL, GH ;
HALL, DJ .
BEHAVIORAL SCIENCE, 1967, 12 (02) :153-&
[9]   MODEL-BASED GAUSSIAN AND NON-GAUSSIAN CLUSTERING [J].
BANFIELD, JD ;
RAFTERY, AE .
BIOMETRICS, 1993, 49 (03) :803-821
[10]   MONTE-CARLO COMPARISONS OF SELECTED CLUSTERING PROCEDURES [J].
BAYNE, CK ;
BEAUCHAMP, JJ ;
BEGOVICH, CL ;
KANE, VE .
PATTERN RECOGNITION, 1980, 12 (02) :51-62