Robust linear model selection by cross-validation

被引:84
作者
Ronchetti, E [1 ]
Field, C [1 ]
Blanchard, W [1 ]
机构
[1] DALHOUSIE UNIV,DEPT MATH STAT & COMP SCI,HALIFAX,NS B3H 3J5,CANADA
关键词
bounded influence; construction sample; outliers; prediction error; robust prediction; validation sample;
D O I
10.2307/2965566
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
This article gives a robust technique for model selection in regression models, an important aspect of any data analysis involving regression. There is a danger that outliers will have an undue influence on the model chosen and distort any subsequent analysis. We provide a robust algorithm for model selection using Shao's cross-validation methods for choice of variables as a starting point. Because Shao's techniques are based on least squares, they are sensitive to outliers. We develop our robust procedure using the same ideas of cross-validation as Shao but using estimators that are optimal bounded influence for prediction. We demonstrate the effectiveness of our robust procedure in providing protection against outliers both in a simulation study and in a real example. We contrast the results with those obtained by Shao's method, demonstrating a substantial improvement in choosing the correct model in the presence of outliers with little loss of efficiency at the normal model.
引用
收藏
页码:1017 / 1023
页数:7
相关论文
共 11 条
[1]  
HAMPEL F. R., 1986, Robust Statistics: The Approach Based on Influence Functions
[2]   INFLUENCE CURVE AND ITS ROLE IN ROBUST ESTIMATION [J].
HAMPEL, FR .
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 1974, 69 (346) :383-393
[3]  
Huber PJ., 1981, ROBUST STATISTICS
[4]  
KRASKER WS, 1982, J AM STAT ASSOC, V77, P59
[5]   A ROBUST VERSION OF MALLOWS C(P) [J].
RONCHETTI, E ;
STAUDTE, RG .
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 1994, 89 (426) :550-559
[6]   ROBUST MODEL SELECTION IN REGRESSION [J].
RONCHETTI, E .
STATISTICS & PROBABILITY LETTERS, 1985, 3 (01) :21-23
[7]  
Ronchetti E., 1982, THESIS ETH ZURICH
[8]   LINEAR-MODEL SELECTION BY CROSS-VALIDATION [J].
SHAO, J .
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 1993, 88 (422) :486-494
[9]  
STANISH HI, 1994, THESIS DALHOUSIE U
[10]   CROSS-VALIDATORY CHOICE AND ASSESSMENT OF STATISTICAL PREDICTIONS [J].
STONE, M .
JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-STATISTICAL METHODOLOGY, 1974, 36 (02) :111-147