Comparative analysis on convergence rates of the EM algorithm and its two modifications for Gaussian mixtures

被引:10
作者
Xu, L [1 ]
机构
[1] CHINESE UNIV HONG KONG,DEPT COMP SCI & ENGN,SHATIN,HONG KONG
关键词
convergence rate; EM algorithm and variants; Gaussian mixture; maximum likelihood; momentum;
D O I
10.1023/A:1009627306313
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
For Gaussian mixture, a comparative analysis has been made on the convergence rate by the Expectation-Maximization (EM) algorithm and its two types of modifications. One is a variant of the EM algorithm (denoted by VEM) which uses the old value of mean vectors instead of the latest updated one in the current updating of the covariance matrices. The other is obtained by adding a momentum term in the EM updating equation, called the Momentum EM algorithm (MEM). Their up-bound convergence rates have been obtained, including an extension and a modification of those given in Xu & Jordan (1996). It has been shown that the EM algorithm and VEM are equivalent in their local convergence and rates, and that the MEM can speed up the convergence of the EM algorithm if a suitable amount of momentum is added. Moreover, a theoretical guide on how to add momentum is proposed, and a possible approach for further speeding up the convergence is suggested.
引用
收藏
页码:69 / 76
页数:8
相关论文
共 7 条
[1]   MAXIMUM LIKELIHOOD FROM INCOMPLETE DATA VIA EM ALGORITHM [J].
DEMPSTER, AP ;
LAIRD, NM ;
RUBIN, DB .
JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-METHODOLOGICAL, 1977, 39 (01) :1-38
[2]  
MA J, 1996, UNPUB JOURNAL
[3]  
MEILIJSON I, 1989, J ROY STAT SOC B MET, V51, P127
[4]   ON THE RATE OF CONVERGENCE OF THE ECM ALGORITHM [J].
MENG, XL .
ANNALS OF STATISTICS, 1994, 22 (01) :326-339
[5]  
PETERS BC, 1978, SIAM J APPL MATH, V35, P362
[6]   MIXTURE DENSITIES, MAXIMUM-LIKELIHOOD AND THE EM ALGORITHM [J].
REDNER, RA ;
WALKER, HF .
SIAM REVIEW, 1984, 26 (02) :195-237
[7]   On convergence properties of the EM algorithm for gaussian mixtures [J].
Xu, L ;
Jordan, MI .
NEURAL COMPUTATION, 1996, 8 (01) :129-151