Singularity and Slow Convergence of the EM algorithm for Gaussian Mixtures

被引：25

作者：

Park, Hyeyoung ^{[1
]}

Ozeki, Tomoko ^{[2
]}

机构：

[1] Kyungpook Natl Univ, Sch Elect Engn & Comp Sci, Taegu 702701, South Korea

[2] Tokai Univ, Dept Human & Informat Sci, Kanagawa 2591292, Japan

来源：

NEURAL PROCESSING LETTERS | 2009年 / 29卷 / 01期

基金：

日本学术振兴会;

关键词：

EM algorithm; Gradient descent learning; Learning dynamics; Singularity; Slow convergence; NEURAL-NETWORK REGRESSION; SOFT COMMITTEE MACHINES; MULTILAYER PERCEPTRONS; DYNAMICS; MODELS;

D O I：

10.1007/s11063-009-9094-4

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Singularities in the parameter spaces of hierarchical learning machines are known to be a main cause of slow convergence of gradient descent learning. The EM algorithm, which is another learning algorithm giving a maximum likelihood estimator, is also suffering from its slow convergence, which often appears when the component overlap is large. We analyze the dynamics of the EM algorithm for Gaussian mixtures around singularities and show that there exists a slow manifold caused by a singular structure, which is closely related to the slow convergence of the EM algorithm. We also conduct numerical simulations to confirm the theoretical analysis. Through the simulations, we compare the dynamics of the EM algorithm with that of the gradient descent algorithm, and show that their slow dynamics are caused by the same singular structure, and thus they have the same behaviors around singularities.

引用

页码：45 / 59

页数：15

共 22 条

[1] Adaptive method of realizing natural gradient learning for multilayer perceptrons [J].