Mixtures of robust probabilistic principal component analyzers

被引:45
作者
Archambeau, Cedric [1 ]
Delannay, Nicolas [2 ]
Verleysen, Michel [2 ]
机构
[1] UCL, Ctr Computat Stat & Machine Learning, London WC1E 6BT, England
[2] Catholic Univ Louvain, Machine Learning Grp, B-1348 Louvain, Belgium
关键词
mixture model; principal component analysis; dimensionality reduction; robustness to outliers; non-Gaussianity; EM algorithm;
D O I
10.1016/j.neucom.2007.11.029
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Mixtures of probabilistic principal component analyzers model high-dimensional nonlinear data by combining local linear models. Each mixture component is specifically designed to extract the local principal orientations in the data. An important issue with this generative model is its sensitivity to data lying off the low-dimensional manifold. In order to address this problem, the mixtures of robust probabilistic principal component analyzers are introduced. They take care of atypical points by means of a long tail distribution, the Student-t. It is shown that the resulting mixture model is an extension of the mixture of Gaussians, suitable for both robust clustering and dimensionality reduction. Finally, we briefly discuss how to construct a robust version of the closely related mixture of factor analyzers. (c) 2008 Elsevier B.V. All rights reserved.
引用
收藏
页码:1274 / 1282
页数:9
相关论文
共 28 条
[1]  
[Anonymous], ADV NEURAL INFORM PR
[2]  
ARCHAMBEAU C, 2007, 15 EUR S ART NEUR NE, P229
[3]  
ARCHAMBEAU C, 2006, 23 INT C MACH LEARN, P33
[4]  
ARCHAMBEAU C, 2005, THESIS U CATHOLIQUE
[5]  
Bishop C. M., 2006, Pattern recognition and machine learning, V128, DOI DOI 10.1117/1.2819119
[6]  
Cox T., 2001, MULTIDIMENSIONAL SCA
[7]   MAXIMUM LIKELIHOOD FROM INCOMPLETE DATA VIA EM ALGORITHM [J].
DEMPSTER, AP ;
LAIRD, NM ;
RUBIN, DB .
JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-METHODOLOGICAL, 1977, 39 (01) :1-38
[8]  
EVERITT BS, 1983, INTRO LATENT VARIABL
[9]  
Fisher R. A., 1925, METRON, V5, P90
[10]  
GLIAHRAMANI Z, 1996, CRGTR961 U TOR DEP C