Improved feasible solution algorithms for high breakdown estimation

被引:54
作者
Hawkins, DM [1 ]
Olive, DJ [1 ]
机构
[1] Univ Minnesota, Dept Appl Stat, St Paul, MN 55108 USA
基金
美国国家科学基金会;
关键词
linear model; outliers; high breakdown estimation; least trimmed squares; minimum volume ellipsoid; minimum covariance determinant;
D O I
10.1016/S0167-9473(98)00082-6
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
High breakdown estimation allows one to get reasonable estimates of the parameters from a sample of data even if that sample is contaminated by large numbers of awkwardly placed outliers. Two particular application areas in which this is of interest are multiple linear regression, and estimation of the location vector and scatter matrix of multivariate data. Standard high breakdown criteria for the regression problem are the least median of squares (LMS) and least trimmed squares (LTS); those for the multivariate location/scatter problem are the minimum volume ellipsoid (MVE) and minimum covariance determinant (MCD). All of these present daunting computational problems. The 'feasible solution algorithms' for these criteria have been shown to have excellent performance for text-book sized problems, but their performance on much larger data sets is less impressive. This paper points out a computationally cheaper feasibility condition for LTS, MVE and MCD, and shows how the combination of the criteria leads to improved performance on large data sets. Algorithms incorporating these improvements are available from the first author's Web site. (C) 1999 Elsevier Science B.V. All rights reserved.
引用
收藏
页码:1 / 11
页数:11
相关论文
共 19 条
[1]  
Agullo J, 1997, INST MATH S, V31, P133
[2]  
AGULLO J, 1996, P COMP STAT, P175
[3]  
[Anonymous], J COMPUTATIONAL GRAP
[4]  
COOK RD, 1990, J AM STAT ASSOC, V85, P640, DOI 10.2307/2289996
[5]   EXACT ITERATIVE COMPUTATION OF THE ROBUST MULTIVARIATE MINIMUM VOLUME ELLIPSOID ESTIMATOR [J].
COOK, RD ;
HAWKINS, DM ;
WEISBERG, S .
STATISTICS & PROBABILITY LETTERS, 1993, 16 (03) :213-218
[6]   An easy way to increase the finite-sample efficiency of the resampled minimum volume ellipsoid estimator [J].
Croux, C ;
Haesbroeck, G .
COMPUTATIONAL STATISTICS & DATA ANALYSIS, 1997, 25 (02) :125-141
[7]  
Grubel R., 1988, METRIKA, V35, P49
[8]   HEDONIC HOUSING PRICES AND DEMAND FOR CLEAN-AIR [J].
HARRISON, D ;
RUBINFELD, DL .
JOURNAL OF ENVIRONMENTAL ECONOMICS AND MANAGEMENT, 1978, 5 (01) :81-102
[9]   THE FEASIBLE SET ALGORITHM FOR LEAST MEDIAN OF SQUARES REGRESSION [J].
HAWKINS, DM .
COMPUTATIONAL STATISTICS & DATA ANALYSIS, 1993, 16 (01) :81-101
[10]   THE FEASIBLE SOLUTION ALGORITHM FOR LEAST TRIMMED SQUARES REGRESSION [J].
HAWKINS, DM .
COMPUTATIONAL STATISTICS & DATA ANALYSIS, 1994, 17 (02) :185-196