GENERALIZED COLLINEARITY DIAGNOSTICS

被引:1226
作者
FOX, J
MONETTE, G
机构
[1] YORK UNIV,MATH & STAT,N YORK M3J 1P3,ONTARIO,CANADA
[2] YORK UNIV,INST SOCIAL RES,STAT CONSULTING SERV,N YORK M3J 1P3,ONTARIO,CANADA
关键词
CANONICAL CORRELATION; JOINT CONFIDENCE REGIONS; SPECTRAL DECOMPOSITION; VARIANCE INFLATION;
D O I
10.2307/2290467
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
Working in the context of the linear model y = X-beta + epsilon, we generalize the concept of variance inflation as a measure of collinearity to a subset of parameters in beta (denoted by beta-1 with the associated columns of X given by X1). The essential idea underlying this generalization is to examine the impact on the precision of estimation-in particular, the size of an ellipsoidal joint confidence region for beta-1-of less-than-optimal selection of other columns of the design matrix (X2), treating still other columns (X0) as unalterable, even hypothetically. In typical applications, X1 contains a set of dummy regressors coding categories of a qualitative variable or a set of polynomial regressors in a quantitative variable; X2 contains all other regressors in the model, save the constant, which is in X0. If sigma-2V denotes the realized variance of beta-1 and sigma-2U is the variance associated with an optimal selection of X2, then the corresponding scaled dispersion ellipsoids to be compared are E(V), = {x : x'V-1x less-than-or-equal-to 1} and E(U) = {x : x'U-1x less-than-or-equal-to 1}, where E(U) is contained in E(V). The two ellipsoids can be compared by considering the radii of E(V) relative to E(U), obtained through the spectral decomposition of V relative to U. We proceed to explore the geometry of generalized variance inflation, to show the relationship of these measures to correlation-matrix determinants and canonical correlations, to consider X matrices structured by relations of marginality among regressor subspaces, to develop the relationship of generalized variance inflation to hypothesis tests in the multivariate normal linear model, and to present several examples.
引用
收藏
页码:178 / 183
页数:6
相关论文
共 19 条
[1]  
ANDREWS DF, 1978, J ROY STAT SOC B MET, V40, P85
[2]  
Belshey D. A., 1980, REGRESSION DIAGNOSTI
[3]   DEMEANING CONDITIONING DIAGNOSTICS THROUGH CENTERING - COMMENT [J].
COOK, RD .
AMERICAN STATISTICIAN, 1984, 38 (02) :78-79
[4]  
EATON ML, 1983, MULTIVARIATE STATISI
[5]   PREDICTING RESPONSE RATES TO MAILED QUESTIONNAIRES - COMMENT [J].
EICHNER, K ;
HABERMEHL, W .
AMERICAN SOCIOLOGICAL REVIEW, 1981, 46 (03) :361-363
[6]  
FOX J, 1984, LINEAR STATISTICAL M
[7]   NONRESPONSE ON SURVEYS - A CANADA-UNITED-STATES COMPARISON [J].
GOYDER, J .
CANADIAN JOURNAL OF SOCIOLOGY-CAHIERS CANADIENS DE SOCIOLOGIE, 1985, 10 (03) :231-251
[9]  
GRAYBILL FA, 1976, THEORY APPLICATION L
[10]   FACTORS AFFECTING RESPONSE RATES TO MAILED QUESTIONNAIRES - QUANTITATIVE-ANALYSIS OF PUBLISHED LITERATURE [J].
HEBERLEIN, TA ;
BAUMGARTNER, R .
AMERICAN SOCIOLOGICAL REVIEW, 1978, 43 (04) :447-462