High breakdown estimators for principal components: the projection-pursuit approach revisited

被引:186
作者
Croux, C
Ruiz-Gazen, A
机构
[1] Univ Toulouse 3, LSP, CNRS, UMR 5583, F-31062 Toulouse, France
[2] Katholieke Univ Leuven, Dept Appl Econ, B-3000 Louvain, Belgium
[3] Univ Toulouse 1, CNRS, UMR 5604, GREMAQ, F-31042 Toulouse, France
关键词
breakdown point; dispersion matrix; influence function; principal components analysis; projection-pursuit; robustness;
D O I
10.1016/j.jmva.2004.08.002
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
Li and Chen (J. Amer. Statist. Assoc. 80 (1985) 759) proposed a method for principal components using projection-pursuit techniques. In classical principal components one searches for directions with maximal variance, and their approach consists of replacing this variance by a robust scale measure. Li and Chen showed that this estimator is consistent, qualitative robust and inherits the breakdown point of the robust scale estimator. We complete their study by deriving the influence function of the estimators for the eigenvectors, eigenvalues and the associated dispersion matrix. Corresponding Gaussian efficiencies are presented as well. Asymptotic normality of the estimators has been treated in a paper of Cui et al. (Biometrika 90 (2003) 953), complementing the results of this paper. Furthermore, a simple explicit version of the projection-pursuit based estimator is proposed and shown to be fast to compute, orthogonally equivariant, and having the maximal finite-sample breakdown point property. We will illustrate the method with a real data example. (c) 2004 Elsevier Inc. All rights reserved.
引用
收藏
页码:206 / 226
页数:21
相关论文
共 37 条
[2]  
[Anonymous], 1987, ROBUST REGRESSION OU
[3]  
BESSE PH, 1993, COMPUTER INTENSIVE M
[4]   Influence functions and outlier detection under the common principal components model: A robust approach [J].
Boente, G ;
Pires, AM ;
Rodrigues, IM .
BIOMETRIKA, 2002, 89 (04) :861-875
[5]  
Boente G, 2001, TRENDS MATH, P117
[6]  
CRITCHLEY F, 1985, BIOMETRIKA, V72, P627, DOI 10.2307/2336735
[7]  
Croux C., 1996, COMPSTAT. Proceedings in Computational Statistics. 12th Symposium, P211
[8]   Principal component analysis based on robust estimators of the covariance or correlation matrix: Influence functions and efficiencies [J].
Croux, C ;
Haesbroeck, G .
BIOMETRIKA, 2000, 87 (03) :603-618
[9]   EFFICIENT HIGH-BREAKDOWN M-ESTIMATORS OF SCALE [J].
CROUX, C .
STATISTICS & PROBABILITY LETTERS, 1994, 19 (05) :371-379
[10]   GENERALIZED S-ESTIMATORS [J].
CROUX, C ;
ROUSSEEUW, PJ ;
HOSSJER, O .
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 1994, 89 (428) :1271-1281