PARALLEL ANALYSIS - A METHOD FOR DETERMINING SIGNIFICANT PRINCIPAL COMPONENTS

被引:278
作者
FRANKLIN, SB
GIBSON, DJ
ROBERTSON, PA
POHLMANN, JT
FRALISH, JS
机构
[1] Department of Plant Biology, Southern Illinois University, Carbondale, Illinois
[2] Education Psychology and Special Education Department, Southern Illinois University, Carbondale, Illinois
[3] Department of Forestry, Southern Illinois University, Carbondale, Illinois
关键词
LITERATURE RESEARCH; OVEREXTRACTION; PRINCIPAL COMPONENTS ANALYSIS; SPURIOUS COMPONENT;
D O I
10.2307/3236261
中图分类号
Q94 [植物学];
学科分类号
071001 ;
摘要
Numerous ecological studies use Principal Components Analysis (PCA) for exploratory analysis and data reduction. Determination of the number of components to retain is the most crucial problem confronting the researcher when using PCA. An incorrect choice may lead to the underextraction of components, but commonly results in overextraction. Of several methods proposed to determine the significance of principal components, Parallel Analysis (PA) has proven consistently accurate in determining the threshold for significant components, variable loadings, and analytical statistics when decomposing a correlation matrix. In this procedure, eigenvalues from a data set prior to rotation are compared with those from a matrix of random values of the same dimensionality (p variables and n samples). PCA eigenvalues from the data greater than PA eigenvalues from the corresponding random data can be retained. All components with eigenvalues below this threshold value should be considered spurious. We illustrate Parallel Analysis on an environmental data set. We reviewed all articles utilizing PCA or Factor Analysis (FA) from 1987 to 1993 from Ecology, Ecological Monographs, Journal of Vegetation Science and Journal of Ecology. Analyses were first separated into those PCA which decomposed a correlation matrix and those PCA which decomposed a covariance matrix. Parallel Analysis (PA) was applied for each PCA/FA found in the literature. Of 39 analyses (in 22 articles), 29 (74.4%) considered no threshold rule, presumably retaining interpretable components. According to the PA results, 26 (66.7%) overextracted components. This overextraction may have resulted in potentially misleading interpretation of spurious components. It is suggested that the routine use of PA in multivariate ordination will increase confidence in the results and reduce the subjective interpretation of supposedly objective methods.
引用
收藏
页码:99 / 106
页数:8
相关论文
共 62 条
[51]  
Schieck J.O., Hannon S.J., Clutch predation, cover, and the overdispersion of nests of the willow ptarmigan, Ecology, 74, pp. 743-750, (1993)
[52]  
Schwaegerle K.E., Bazzaz F.A., Differentiation among nine populations of Phlox: response to environmental gradients, Ecology, 68, pp. 54-64, (1987)
[53]  
Singh T., West N.E., Comparison of some multivariate analyses of perennial Atriplex vegetation in southeastern Utah, Vegetatio, 23, pp. 289-313, (1971)
[54]  
Skinner H.A., Dimensions and clusters: a hybrid approach to classification, Applied Psychological Measurement, 3, pp. 327-341, (1979)
[55]  
Smith T.J., Seed predation in relation to tree dominance and distribution in mangrove forests, Ecology, 68, pp. 266-273, (1987)
[56]  
Sun C.Y., Feoli E., Trajectory analysis of Chinese vegetation types in a multidimensional climatic space, Journal of Vegetation Science, 3, pp. 587-594, (1992)
[57]  
ter Braak C.J.F., CANOCO — a FORTRAN program for canonical community ordination by [partial] [detrended] [canonical] correlation analysis, principal components analysis and redundancy analysis (version 2.1)., (1988)
[58]  
Velicer W.F., Determining the number of components from the matrix of partial correlations, Psychometrica, 41, pp. 321-327, (1976)
[59]  
Wiens J.A., Ecological similarity of shrub‐desert avifaunas of Australia and North America, Ecology, 72, pp. 479-495, (1991)
[60]  
Wikramanayake E.D., Ecomorphology and biogeography of a tropical stream fish assemblage: evolution of assemblage structure, Ecology, 71, pp. 1756-1764, (1990)