Fast model selection for robust calibration methods

被引:11
作者
Engelen, S [1 ]
Hubert, M [1 ]
机构
[1] Katholieke Univ Leuven, Dept Math, B-3001 Louvain, Belgium
关键词
robustness; model selection; cross-validation; PCR; PLS;
D O I
10.1016/j.aca.2005.01.015
中图分类号
O65 [分析化学];
学科分类号
070302 ; 081704 ;
摘要
One of the main issues in principal component regression (PCR) and partial least squares regression (PLSR) is the selection of the number of principal components. To this end, the curve with the root mean squared error of cross-validated prediction (RMSECV) is often described in the literature as a very helpful graphical tool. In this paper, we focus on model selection for robust calibration methods. We first propose a robust RMSECV value and then use it to define a new criterion for the selecting of the optimal number of components. This robust component selection (RCS) statistic combines the goodness-of-fit and the predictive power of the model. As the algorithms to compute these robust PCR and PLSR estimators are more complex and slower than the classical approaches, cross-validation becomes very time consuming. Hence, we propose fast algorithms to compute the robust RMSECV values. We evaluate the developed procedures at several data sets. (c) 2005 Elsevier B.V. All rights reserved.
引用
收藏
页码:219 / 228
页数:10
相关论文
共 25 条
[11]   Model selection for partial least squares regression [J].
Li, BB ;
Morris, J ;
Martin, EB .
CHEMOMETRICS AND INTELLIGENT LABORATORY SYSTEMS, 2002, 64 (01) :79-89
[12]  
McQuarrie AD., 1998, Regression and Time Series Model Selection
[13]   THE EFFICIENT CROSS-VALIDATION OF PRINCIPAL COMPONENTS APPLIED TO PRINCIPAL COMPONENT REGRESSION [J].
MERTENS, B ;
FEARN, T ;
THOMPSON, M .
STATISTICS AND COMPUTING, 1995, 5 (03) :227-235
[14]  
Neter J., 1990, APPL LINEAR STAT MOD
[15]   Interval partial least-squares regression (iPLS):: A comparative chemometric study with an example from near-infrared spectroscopy [J].
Norgaard, L ;
Saudland, A ;
Wagner, J ;
Nielsen, JP ;
Munck, L ;
Engelsen, SB .
APPLIED SPECTROSCOPY, 2000, 54 (03) :413-419
[16]   Conditional Fisher's exact test as a selection criterion for pair-correlation method.: Type I and Type II errors [J].
Rajkó, R ;
Héberger, K .
CHEMOMETRICS AND INTELLIGENT LABORATORY SYSTEMS, 2001, 57 (01) :1-14
[17]   Robust linear model selection by cross-validation [J].
Ronchetti, E ;
Field, C ;
Blanchard, W .
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 1997, 92 (439) :1017-1023
[18]   Robust multivariate regression [J].
Rousseeuw, PJ ;
van Aelst, S ;
van Driessen, K ;
Agulló, J .
TECHNOMETRICS, 2004, 46 (03) :293-305
[19]  
ROUSSEEUW PJ, 1990, J AM STAT ASSOC, V85, P633, DOI 10.2307/2289995
[20]   A fast algorithm for the minimum covariance determinant estimator [J].
Rousseeuw, PJ ;
Van Driessen, K .
TECHNOMETRICS, 1999, 41 (03) :212-223