Determining the number of clusters by sampling with replacement

被引:21
作者
Tonidandel, S
Overall, JE
机构
[1] Davidson Coll, Dept Psychol, Davidson, NC 28035 USA
[2] Univ Texas, Hlth Sci Ctr, Dept Psychiat, Houston, TX USA
关键词
D O I
10.1037/1082-989X.9.2.238
中图分类号
B84 [心理学];
学科分类号
04 ; 0402 ;
摘要
A split-sample replication criterion originally proposed by J. E. Overall and K. N. Magee (1992) as a stopping rule for hierarchical cluster analysis is applied to multiple data sets generated by sampling with replacement from an original simulated primary data set. An investigation of the validity of this bootstrap procedure was undertaken using different combinations of the true number of latent populations, degrees of overlap, and sample sizes. The bootstrap procedure enhanced the accuracy of identifying the true number of latent populations under virtually all conditions. Increasing the size of the resampled data sets relative to the size of the primary data set further increased accuracy. A computer program to implement the bootstrap stopping rule is made available via a referenced Web site.
引用
收藏
页码:238 / 249
页数:12
相关论文
共 28 条
[1]  
[Anonymous], 1971, MATH ARCHAEOLOGICAL
[2]  
[Anonymous], 1939, CLUSTER ANAL CORRELA
[3]  
[Anonymous], 1969, CLUSTERING AGGREGATI
[4]   COMPARATIVE-EVALUATION OF 2 SUPERIOR STOPPING RULES FOR HIERARCHICAL CLUSTER-ANALYSIS [J].
ATLAS, RS ;
OVERALL, JE .
PSYCHOMETRIKA, 1994, 59 (04) :581-591
[5]  
Bradley L A, 1978, J Behav Med, V1, P253, DOI 10.1007/BF00846678
[6]   REPLICATING CLUSTER-ANALYSIS - METHOD, CONSISTENCY, AND VALIDITY [J].
BRECKENRIDGE, JN .
MULTIVARIATE BEHAVIORAL RESEARCH, 1989, 24 (02) :147-161
[7]  
Calinski T., 1974, COMMUN STAT, V3, P1, DOI [10.1080/03610927408827101, DOI 10.1080/03610927408827101]
[8]  
Chernick MR., 1999, Bootstrap methods
[9]  
a practitioner's guide
[10]  
Efron B., 1982, SOC IND APPL MATH CB, V38, DOI [10.1137/1.9781611970319, DOI 10.1137/1.9781611970319]