Giving meaningful interpretation to ordination axes: Assessing loading significance in principal component analysis

被引:308
作者
Peres-Neto, PR [1 ]
Jackson, DA [1 ]
Somers, KM [1 ]
机构
[1] Univ Toronto, Dept Zool, Toronto, ON M5S 3G5, Canada
关键词
bootstrap; eigenvector coefficients; multivariate analysis; numerical ecology; power; randomization;
D O I
10.1890/00-0634
中图分类号
Q14 [生态学(生物生态学)];
学科分类号
071012 ; 0713 ;
摘要
Principal component analysis (PCA) is one of the most commonly used tools in the analysis of ecological data. This method reduces the effective dimensionality of a multivariate data set by producing linear combinations of the original variables (i.e., components) that summarize the predominant patterns in the data. In order to provide meaningful interpretations for principal components, it is important to determine which variables are associated with particular components. Some data analysts incorrectly test the statistical significance of the correlation between original variables and multivariate scores using standard statistical tables. Others interpret eigenvector coefficients larger than an arbitrary absolute value (e.g., 0.50). Resampling, randomization techniques, and parallel analysis have been applied in a few cases. In this study, we compared the performance of a variety of approaches for assessing the significance of eigenvector coefficients in terms of type I error rates and power. Two novel approaches based on the broken-stick model were also evaluated. We used a variety of simulated scenarios to examine the influence of the number of real dimensions in the data; unique versus complex variables the magnitude of eigenvector coefficients; and the number of variables associated with a particular dimension. Our results revealed that bootstrap confidence intervals and a modified bootstrap confidence interval for the broken-stick model proved to be the most reliable techniques.
引用
收藏
页码:2347 / 2363
页数:17
相关论文
共 49 条
[1]   An empirical comparison of permutation methods for tests of partial regression coefficients in a linear model [J].
Anderson, MJ ;
Legendre, P .
JOURNAL OF STATISTICAL COMPUTATION AND SIMULATION, 1999, 62 (03) :271-303
[2]  
Anderson T., 1984, INTRO MULTIVARIATE S
[3]  
[Anonymous], 1979, Morphometrics, the multivariate analysis of biological data
[4]  
[Anonymous], 1995, Randomization tests
[5]   Species composition and morphological structure of the bat fauna of Yucatan, Mexico [J].
Arita, HT .
JOURNAL OF ANIMAL ECOLOGY, 1997, 66 (01) :83-97
[6]   REMARKS ON PARALLEL ANALYSIS [J].
BUJA, A ;
EYUBOGLU, N .
MULTIVARIATE BEHAVIORAL RESEARCH, 1992, 27 (04) :509-540
[7]   LOADINGS AND CORRELATIONS IN THE INTERPRETATION OF PRINCIPAL COMPONENTS [J].
CADIMA, J ;
JOLLIFFE, IT .
JOURNAL OF APPLIED STATISTICS, 1995, 22 (02) :203-214
[8]  
CHATFIELD C, 1980, INTRO MULIVARIATE AN
[9]   STUDY OF SAMPLING ERRORS IN FACTOR ANALYSIS BY MEANS OF ARTIFICIAL EXPERIMENTS [J].
CLIFF, N ;
HAMBURGER, CD .
PSYCHOLOGICAL BULLETIN, 1967, 68 (06) :430-+
[10]  
Diniz JAF, 1998, EVOLUTION, V52, P1247, DOI 10.1111/j.1558-5646.1998.tb02006.x