Partial robust M-regression

被引:152
作者
Serneels, S
Croux, C
Filzmoser, P
Van Espen, PJ
机构
[1] Univ Antwerp, Dept Chem, B-2020 Antwerp, Belgium
[2] Katholieke Univ Leuven, Dept Appl Econ, Louvain, Belgium
[3] Vienna Univ Technol, Dept Stat & Probabil Theory, A-1060 Vienna, Austria
关键词
calibration; partial least squares; M-estimators; prediction; outliers; robustness; spectrometric quantization;
D O I
10.1016/j.chemolab.2005.04.007
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Partial Least Squares (PLS) is a standard statistical method in chemometrics. It can be considered as an incomplete, or "partial", version of the Least Squares estimator of regression, applicable when high or perfect multicollinearity is present in the predictor variables. The Least Squares estimator is well-known to be an optimal estimator for regression, but only when the error terms are normally distributed. In the absence of normality, and in particular when outliers are in the data set, other more robust regression estimators have better properties. In this paper a "partial" version of M-regression estimators will be defined. If an appropriate weighting scheme is chosen, partial M-estimators become entirely robust to any type of outlying points, and are called Partial Robust M-estimators. It is shown that partial robust M-regression outperforms existing methods for robust PLS regression in terms of statistical precision and computational speed, while keeping good robustness properties. The method is applied to a data set consisting of EPXMA spectra of archaeological glass vessels. This data set contains several outliers, and the advantages of partial robust M-regression are illustrated. Applying partial robust M-regression yields much smaller prediction errors for noisy calibration samples than PLS. On the other hand, if the data follow perfectly well a normal model, the loss in efficiency to be paid for is very small. (c) 2005 Elsevier B.V. All rights reserved.
引用
收藏
页码:55 / 64
页数:10
相关论文
共 16 条
[1]  
[Anonymous], 1987, ROBUST REGRESSION OU
[2]   Influence function and efficiency of the minimum covariance determinant scatter matrix estimator [J].
Croux, C ;
Haesbroeck, G .
JOURNAL OF MULTIVARIATE ANALYSIS, 1999, 71 (02) :161-190
[3]   Principal component analysis based on robust estimators of the covariance or correlation matrix: Influence functions and efficiencies [J].
Croux, C ;
Haesbroeck, G .
BIOMETRIKA, 2000, 87 (03) :603-618
[4]  
CROUX C, IN PRESS J MULTIVAR
[5]   ITERATIVELY REWEIGHTED PARTIAL LEAST-SQUARES - A PERFORMANCE ANALYSIS BY MONTE-CARLO SIMULATION [J].
CUMMINS, DJ ;
ANDREWS, CW .
JOURNAL OF CHEMOMETRICS, 1995, 9 (06) :489-507
[6]   SIMPLS - AN ALTERNATIVE APPROACH TO PARTIAL LEAST-SQUARES REGRESSION [J].
DEJONG, S .
CHEMOMETRICS AND INTELLIGENT LABORATORY SYSTEMS, 1993, 18 (03) :251-263
[7]  
DEMPSTER AP, 1980, MULTIVARIATE ANAL, V5, P35
[8]   COMPARISON OF SEMIROBUST AND ROBUST PARTIAL LEAST-SQUARES PROCEDURES [J].
GRIEP, MI ;
WAKELING, IN ;
VANKEERBERGHEN, P ;
MASSART, DL .
CHEMOMETRICS AND INTELLIGENT LABORATORY SYSTEMS, 1995, 29 (01) :37-50
[9]  
Hossjer 0., 1995, NONPARAMETRIC STAT, V4, P293
[10]  
Huber P. J., 1981, ROBUST STAT