SAMPLE-SIZE REQUIREMENTS FOR MULTIPLE OUTLIER LOCATION TECHNIQUES BASED ON ELEMENTAL SETS

被引:2
作者
BRADU, D
HAWKINS, DM
机构
[1] UNIV S AFRICA,DEPT STAT,PRETORIA 0001,SOUTH AFRICA
[2] UNIV MINNESOTA,DEPT APPL STAT,ST PAUL,MN 55108
关键词
REGRESSION OUTLIER; ELEMENTAL SET REGRESSION; LEAST MEDIAN OF SQUARES; PROBABILITY OF SUCCESS;
D O I
10.1016/0167-9473(93)90128-G
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
The identification of multiple outliers in regression data can be attained via robust regression. A class of robust regression techniques relies on investigation of the different elemental regressions - those determined by a minimal set of p observations, where p is the number of linear regression coefficients. exhaustive enumeration of all such elemental regressions is possible only in small problems. For larger data sets standard practice is to take a sample of N elemental regressions and for each one of the value of a selection statistic is obtained. The minimal value of the statistic points out the elemental regression which is the output of the method. Rousseeuw's LMS (Least Median of Squares) selection statistic defines a method which has a maximal breakdown point, is computationally manageable, and so has become popular. The performance of such a technique is usually demonstrated by means of examples, where there is success, but no quantitative evaluation is made. In this paper, the performance of a technique for a data set of known structure is evaluated by calculating the provability of success as a function of N, the number of elemental sets drawn. Success means an output of the procedure which is helpful in locating the outliers. The performance of Rousseeuw's LMS is analyzed in detail for ' two known data sets.
引用
收藏
页码:257 / 270
页数:14
相关论文
共 6 条
[1]  
[Anonymous], 2003, ROBUST REGRESSION OU
[2]  
BRADU D, 1982, TECHNOMETRICS, V24, P103
[3]  
DRAPER N, 1966, APPLIED REGRESSION A
[4]   LOCATION OF SEVERAL OUTLIERS IN MULTIPLE-REGRESSION DATA USING ELEMENTAL SETS [J].
HAWKINS, DM ;
BRADU, D ;
KASS, GV .
TECHNOMETRICS, 1984, 26 (03) :197-208
[5]  
PORTNOY S, 1987, STATISTICAL DATA ANA
[6]   LEAST MEDIAN OF SQUARES REGRESSION [J].
ROUSSEEUW, PJ .
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 1984, 79 (388) :871-880