Comparison of single and complete linkage clustering with the hierarchical factor classification of variables

被引:9
作者
Camiz, S.
Pillar, V. D.
机构
[1] Univ Roma La Sapienza, Dipartimento Matemat Guido Castelnuovo, I-00185 Rome, Italy
[2] Univ Fed Rio Grande do Sul, Dept Ecol, BR-91540 Porto Alegre, RS, Brazil
关键词
classification of variables; comparison of methods; hierarchical classification; principal components analysis; randomization tests; simulated correlation matrices;
D O I
10.1556/ComEc.8.2007.1.4
中图分类号
Q14 [生态学(生物生态学)];
学科分类号
071012 ; 0713 ;
摘要
We assess the performance of a new clustering method for Hierarchical Factor Classification of variables, which is based on the evaluation of the least differences among representative variables of groups, as defined by a set of two-dimensional Principal Components Analysis. As an additional feature the method gives at each step a principal plane where both grouped variables and units, as seen only by these variables, can be projected. We compare the method results with both single and complete linkage clustering, applied to simulated data with known correlation structure and we evaluate the results with a coherence measure based on the entropy between the expected partitions and those found by the methods. We found that the Hierarchical Factor Classification method performed as good as, and in some cases better than, both single and complete linkage clustering in detecting the known group structures in simulated data, with the advantage that the groups of variables and the units can be viewed on principal planes where usual interpretations apply.
引用
收藏
页码:25 / 30
页数:6
相关论文
共 21 条
[1]  
Anderberg M.R., 1973, Probability and Mathematical Statistics
[2]  
[Anonymous], J STAT COMPUT SIMUL
[3]   Hierarchical factor classification of variables in ecology [J].
Camiz, S. ;
Denima, J. -J. ;
Pillar, V. D. .
COMMUNITY ECOLOGY, 2006, 7 (02) :165-179
[4]  
Denimal J.J., 2001, P 10 INT S APPL STOC
[5]  
Florek K., 1951, COLLOQ MATH-WARSAW, V2, P282, DOI DOI 10.4064/CM-2-3-4-282-285
[6]  
Gordon A, 1999, Classification
[7]   A GENERAL THEORY OF CLASSIFICATORY SORTING STRATEGIES .1. HIERARCHICAL SYSTEMS [J].
LANCE, GN ;
WILLIAMS, WT .
COMPUTER JOURNAL, 1967, 9 (04) :373-&
[8]  
LEGENDRE L., 1983, NUMERICAL ECOLOGY, DOI DOI 10.1017/CBO9781107415324.004
[9]  
Lerman I.C., 1991, APPL STOCH MODEL BUS, V7, P63
[10]   AN EXAMINATION OF PROCEDURES FOR DETERMINING THE NUMBER OF CLUSTERS IN A DATA SET [J].
MILLIGAN, GW ;
COOPER, MC .
PSYCHOMETRIKA, 1985, 50 (02) :159-179