ON THE PREDICTIVE PERFORMANCE OF BIASED REGRESSION METHODS AND MULTIPLE LINEAR-REGRESSION

被引:22
作者
KOWALSKI, KG
机构
[1] G.D. Searle and Co. Preclinical Statistics Dept., Skokie, IL 60077
关键词
D O I
10.1016/0169-7439(90)80096-O
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Kowalski, K.G., 1990. On the predictive performance of biased regression methods and multiple linear regression. Chemometrics and Intelligent Laboratory Systems, 9: 177-184. The predictive performance of three commonly used biased regression methods (ridge, principal components and partial least squares) and multiple linear regression (ordinary least squares) using classical model selection techniques are evaluated on five data sets published in the chemical and statistical literature. For these five data sets, the degree of collinearity among the regressors varies considerably. For each data set, cross-validation is performed and the prediction error sum of squares (PRESS) is computed to assess the predictive performance of each method. The results show that multiple linear regression using reduced models obtained by classical methods of model selection performed better (lower PRESS) than the three commonly used biased regression methods. © 1990.
引用
收藏
页码:177 / 184
页数:8
相关论文
共 21 条
[1]  
Hoerl, Kennard, Ridge regression Applications to nonorthogonal problems, Technometrics, 12, pp. 69-82, (1970)
[2]  
Draper, Smith, Applied Regression Analysis, (1981)
[3]  
Geladi, Notes on the history of partial least squares (PLS) modeling, Journal of Chemometrics, 2, pp. 231-246, (1988)
[4]  
Sjostrom, Wold, Lindberg, Persson, Martens, A multivariate calibration problem in analytical chemistry solved by partial least-squares in latent variables, Analytica Chimica Acta, 150, pp. 61-70, (1983)
[5]  
Dunn, Wold, Edlund, Hellberg, Gasteiger, Multivariate structure—activity relationships between data from a battery of biological tests and an ensemble of structure descriptors: The PLS method, Quantitative Structure-Activity Relationships, 3, pp. 131-137, (1984)
[6]  
Montgomery, Peck, Introduction to Linear Regression Analysis, (1982)
[7]  
Allen, The Prediction Sum of Squares as a Criterion for Selecting Variables, (1971)
[8]  
Allen, The relationship between variable selection and data augmentation and a method for prediction, Technometrics, 16, pp. 125-127, (1974)
[9]  
Geisser, The predictive sample reuse method with applications, Journal of the American Statistical Association, 70, pp. 320-328, (1975)
[10]  
Golub, Heath, Wahba, Generalized cross validation as a method for choosing a good ridge parameter, Technometrics, 21, pp. 215-224, (1979)