A GENERALIZED EXTREME STUDENTIZED RESIDUAL MULTIPLE-OUTLIER-DETECTION PROCEDURE IN LINEAR-REGRESSION

被引:46
作者
PAUL, SR
FUNG, KY
机构
关键词
MAXIMUM ABSOLUTE STUDENTIZED RESIDUAL; 2-PHASE PROCEDURE;
D O I
10.2307/1268785
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
This article is concerned with procedures for detecting multiple y outliers in linear regression. A generalized extreme studentized residual (GESR) procedure, which controls type I error rate, is developed. An approximate formula to calculate the percentiles is given for large samples and more accurate percentiles for n less-than-or-equal-to 25 are tabulated. The performance of this procedure is compared with others by Monte Carlo techniques and found to be superior. The procedure, however, fails in detecting y outliers that are on high-leverage cases. For this, a two-phase procedure is suggested. In phase 1, a set of suspect observations is identified by GESR and one of the diagnostics applied sequentially. In phase 2, a backward testing is conducted using the GESR procedure to see which of the suspect cases are outliers. Several examples are analyzed.
引用
收藏
页码:339 / 348
页数:10
相关论文
共 20 条
[11]   TESTING FOR A SINGLE OUTLIER IN A LINEAR-MODEL [J].
DOORNBOS, R .
BIOMETRICS, 1981, 37 (04) :705-711
[12]   DETECTING OUTLIERS .2. SUPPLEMENTING DIRECT ANALYSIS OF RESIDUALS [J].
GENTLEMAN, JF ;
WILK, MB .
BIOMETRICS, 1975, 31 (02) :387-410
[13]  
Hawkins D.M., 1980, LONDON CHAPMAN HALL
[14]   HAT MATRIX IN REGRESSION AND ANOVA [J].
HOAGLIN, DC ;
WELSCH, RE .
AMERICAN STATISTICIAN, 1978, 32 (01) :17-22
[15]   A MULTISTAGE PROCEDURE FOR DETECTING SEVERAL OUTLIERS IN LINEAR-REGRESSION [J].
MARASINGHE, MG .
TECHNOMETRICS, 1985, 27 (04) :395-399
[16]  
Mickey M R, 1967, Comput Biomed Res, V1, P105, DOI 10.1016/0010-4809(67)90009-2
[17]  
ROSNER B, 1983, TECHNOMETRRICS, V26, P165
[18]  
Rousseeuw P., 2003, ROBUST REGRESSION OU
[19]  
ROUSSEEUW PJ, 1990, J AM STAT ASSOC, V85, P633, DOI 10.2307/2289995
[20]  
1982, IMSL LIBRARY, V3