THE FEASIBLE SOLUTION ALGORITHM FOR LEAST TRIMMED SQUARES REGRESSION

被引:50
作者
HAWKINS, DM [1 ]
机构
[1] UNIV MINNESOTA,DEPT APPL STAT,ST PAUL,MN 55108
基金
美国国家科学基金会;
关键词
LINEAR MODEL; OUTLIERS; HIGH BREAKDOWN REGRESSION;
D O I
10.1016/0167-9473(92)00070-8
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Least trimmed squares (LTS) is a criterion for analyzing multiple regression data sets in which there may be outliers. The method consists of finding that subset of cases whose deletion from the data set would lead to the regression with the smallest residual sum of squares. It is used as a general-purpose high breakdown method, and also has some inferential motivation in that it gives the maximum likelihood estimator of the regression under a widely-used outlier model. It is also theoretically attractive in that it has standard O(n-1/2) asymptotics. Its practical usefulness has however been limited by the absence of a computationally manageable exact algorithm for performing LTS fits. This paper capitalizes on a necessary condition characterizing the LTS fit to develop a probabilistic 'feasible solution' algorithm. This algorithm takes random starting trial solutions and refines each to the local optimum satisfying this necessary condition. Repeating this using different starting sets provides the global optimum with arbitrarily high probability for sufficiently many random starts. Exhaustive enumeration of several standard data sets from the literature verifies the method's good performance. An example of a very large data set shows that the usefulness of the method is not confined to the small samples often used in the robustness and outlier literature.
引用
收藏
页码:185 / 196
页数:12
相关论文
共 11 条
[1]  
[Anonymous], 2003, ROBUST REGRESSION OU
[2]  
ATKINSON AC, 1991, DIRECTIONS ROBUST 1, P7
[3]   DETECTING OUTLIERS .2. SUPPLEMENTING DIRECT ANALYSIS OF RESIDUALS [J].
GENTLEMAN, JF ;
WILK, MB .
BIOMETRICS, 1975, 31 (02) :387-410
[4]   HEDONIC HOUSING PRICES AND DEMAND FOR CLEAN-AIR [J].
HARRISON, D ;
RUBINFELD, DL .
JOURNAL OF ENVIRONMENTAL ECONOMICS AND MANAGEMENT, 1978, 5 (01) :81-102
[5]   LOCATION OF SEVERAL OUTLIERS IN MULTIPLE-REGRESSION DATA USING ELEMENTAL SETS [J].
HAWKINS, DM ;
BRADU, D ;
KASS, GV .
TECHNOMETRICS, 1984, 26 (03) :197-208
[7]  
Hawkins Douglas M, 1980, IDENTIFICATION OUTLI, DOI [DOI 10.1007/978-94-015-3994-4, 10.1007/978-94-015-3994-4]
[8]  
MARAZZI A, 1991, DIRECTIONS ROBUST 1, P183
[9]  
MASON RL, 1989, STATISTICAL DESIGN A
[10]   LEAST MEDIAN OF SQUARES REGRESSION [J].
ROUSSEEUW, PJ .
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 1984, 79 (388) :871-880