A GENERALIZED EXTREME STUDENTIZED RESIDUAL MULTIPLE-OUTLIER-DETECTION PROCEDURE IN LINEAR-REGRESSION

被引:46
作者
PAUL, SR
FUNG, KY
机构
关键词
MAXIMUM ABSOLUTE STUDENTIZED RESIDUAL; 2-PHASE PROCEDURE;
D O I
10.2307/1268785
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
This article is concerned with procedures for detecting multiple y outliers in linear regression. A generalized extreme studentized residual (GESR) procedure, which controls type I error rate, is developed. An approximate formula to calculate the percentiles is given for large samples and more accurate percentiles for n less-than-or-equal-to 25 are tabulated. The performance of this procedure is compared with others by Monte Carlo techniques and found to be superior. The procedure, however, fails in detecting y outliers that are on high-leverage cases. For this, a two-phase procedure is suggested. In phase 1, a set of suspect observations is identified by GESR and one of the diagnostics applied sequentially. In phase 2, a backward testing is conducted using the GESR procedure to see which of the suspect cases are outliers. Several examples are analyzed.
引用
收藏
页码:339 / 348
页数:10
相关论文
共 20 条
[1]  
ANDREWS DF, 1978, J ROY STAT SOC B MET, V40, P85
[2]  
ATKINSON AC, 1985, PLOTS TRANSFORMATION
[3]  
BARNETT V, 1984, OUTLIERS STATISTICAL
[4]  
Belsley D., 1980, REGRESSION DIAGNOSTI
[5]  
Chatterjee S., 1988, SENSITIVITY ANAL LIN, DOI 10.1002/9780470316764
[6]  
Cook R Dennis, 1982, RESIDUALS INFLUENCE
[7]   ON THE ACCURACY OF BONFERRONI SIGNIFICANCE LEVELS FOR DETECTING OUTLIERS IN LINEAR-MODELS [J].
COOK, RD ;
PRESCOTT, P .
TECHNOMETRICS, 1981, 23 (01) :59-63
[8]   DETECTION OF INFLUENTIAL OBSERVATION IN LINEAR-REGRESSION [J].
COOK, RD .
TECHNOMETRICS, 1977, 19 (01) :15-18
[9]   INFLUENTIAL OBSERVATIONS IN LINEAR-REGRESSION [J].
COOK, RD .
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 1979, 74 (365) :169-174
[10]  
Daniel C., 1971, FITTING EQUATIONS DA