Robust kernel principal component analysis and classification

被引:3
作者
Michiel Debruyne
Tim Verdonck
机构
来源
Advances in Data Analysis and Classification | 2010年 / 4卷
关键词
Principal component analysis; Kernel methods; Classification; Robustness; 62H30; 62G35;
D O I
暂无
中图分类号
学科分类号
摘要
Kernel principal component analysis (KPCA) extends linear PCA from a real vector space to any high dimensional kernel feature space. The sensitivity of linear PCA to outliers is well-known and various robust alternatives have been proposed in the literature. For KPCA such robust versions received considerably less attention. In this article we present kernel versions of three robust PCA algorithms: spherical PCA, projection pursuit and ROBPCA. These robust KPCA algorithms are analyzed in a classification context applying discriminant analysis on the KPCA scores. The performances of the different robust KPCA algorithms are studied in a simulation study comparing misclassification percentages, both on clean and contaminated data. An outlier map is constructed to visualize outliers in such classification problems. A real life example from protein classification illustrates the usefulness of robust KPCA and its corresponding outlier map.
引用
收藏
页码:151 / 167
页数:16
相关论文
共 79 条
[1]  
Alzate C(2008)Kernel component analysis using an epsilon-insensitive robust loss function IEEE Trans Neural Netw 19 1583-1598
[2]  
Suykens JAK(2005)High breakdown estimators for principal components: the projection- pursuit approach revisited J Multivar Anal 95 206-226
[3]  
Croux C(2007)Algorithms for projection-pursuit robust principal component analysis Chemom Intell Lab Syst 87 218-225
[4]  
Ruiz-Gazen A(2003)Asymptotic distributions of principal components based on robust dispersions Biometrika 90 953-966
[5]  
Croux C(2009)An outlier map for support vector machine classification Ann Appl Stat 3 1566-1580
[6]  
Filzmoser P(2009)The influence function of the Stahel-Donoho covariance estimator of smallest outlyingness Stat Probab Lett 79 275-282
[7]  
Oliveira MR(2009)Robustified least squares support vector classification J Chemometrics 23 479-486
[8]  
Cui H(1992)Breakdown properties of location estimates based on half-space depth and projected outlyingness Ann Stat 20 1803-1827
[9]  
He X(1974)A projection pursuit algorithm for exploratory data analysis IEEE Trans Comput C-23 881-890
[10]  
Ng KW(1985)Projection pursuit Ann Stat 13 435-475