Robust principal components regression based on principal sensitivity vectors

被引:25
作者
Zhang, MH [1 ]
Xu, QS [1 ]
Massart, DL [1 ]
机构
[1] Free Univ Brussels, Inst Pharmaceut, ChemoAC, B-1090 Brussels, Belgium
关键词
RPPSV; outlier; robust principal components regression; principal sensitivity vectors;
D O I
10.1016/S0169-7439(03)00095-9
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
A robust method called robust principal components regression based on principal sensitivity vectors (RPPSV) is developed for outlier detection in regression. The method is evaluated by its outlier detection ability and the root mean square error of prediction (RMSEP) for a test set using simulated data sets based on a real green tea data set. The results are compared with those obtained from several robust outlier diagnostic methods. It shows that when the data set is lowly contaminated, the RPPSV has good outlier detection ability, especially for bad leverage points, and its RMSEP value is comparable to the other selected methods. When the data set is highly contaminated, the RPPSV has the best outlier detection ability and the lowest RMSEP. (C) 2003 Elsevier B.V. All rights reserved.
引用
收藏
页码:175 / 185
页数:11
相关论文
共 16 条
[1]  
[Anonymous], 1989, MULTIVARIATE CALIBRA
[2]   ROBUST ESTIMATES, RESIDUALS, AND OUTLIER DETECTION WITH MULTIRESPONSE DATA [J].
GNANADESIKAN, R ;
KETTENRING, JR .
BIOMETRICS, 1972, 28 (01) :81-+
[3]   A fast method for robust principal components with applications to chemometrics [J].
Hubert, M ;
Rousseeuw, PJ ;
Verboven, S .
CHEMOMETRICS AND INTELLIGENT LABORATORY SYSTEMS, 2002, 60 (1-2) :101-111
[4]  
HUBERT M, 2002, UNPUB
[5]   PROJECTION-PURSUIT APPROACH TO ROBUST DISPERSION MATRICES AND PRINCIPAL COMPONENTS - PRIMARY THEORY AND MONTE-CARLO [J].
LI, GY ;
CHEN, ZL .
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 1985, 80 (391) :759-766
[6]   Robust methods for multivariate analysis - A tutorial review [J].
Liang, YZ ;
Kvalheim, OM .
CHEMOMETRICS AND INTELLIGENT LABORATORY SYSTEMS, 1996, 32 (01) :1-10
[7]   Feasibility study for the use of near infrared spectroscopy in the qualitative and quantitative analysis of green tea, Camellia sinensis (L.) [J].
Luypaert, J ;
Zhang, MH ;
Massart, DL .
ANALYTICA CHIMICA ACTA, 2003, 478 (02) :303-312
[8]  
Massart D.L., 1997, HDB CHEMOMETRICS QUA, DOI 10.1021/ci980427d
[9]   LEAST MEDIAN OF SQUARES - A ROBUST METHOD FOR OUTLIER AND MODEL ERROR-DETECTION IN REGRESSION AND CALIBRATION [J].
MASSART, DL ;
KAUFMAN, L ;
ROUSSEEUW, PJ ;
LEROY, A .
ANALYTICA CHIMICA ACTA, 1986, 187 :171-179
[10]   A fast procedure for outlier diagnostics in large regression problems [J].
Peña, D ;
Yohai, V .
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 1999, 94 (446) :434-445