Probabilistic principal component analysis

被引:2149
作者
Tipping, ME [1 ]
Bishop, CM [1 ]
机构
[1] Microsoft Res, Cambridge CB2 3NH, England
关键词
density estimation; EM algorithm; Gaussian mixtures; maximum likelihood; principal component analysis; probability model;
D O I
10.1111/1467-9868.00196
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
Principal component analysis (PCA) is a ubiquitous technique for data analysis and processing, but one which is not based on a probability model. We demonstrate how the principal axes of a set of observed data vectors may be determined through maximum likelihood estimation of parameters in a latent variable model that is closely related to factor analysis. We consider the properties of the associated likelihood function, giving an EM algorithm for estimating the principal subspace iteratively, and discuss, with illustrative examples, the advantages conveyed by this probabilistic approach to PCA.
引用
收藏
页码:611 / 622
页数:12
相关论文
共 17 条