A greedy EM algorithm for Gaussian mixture learning

被引:230
作者
Vlassis, N
Likas, A
机构
[1] Univ Amsterdam, RWCP Autonomous Learning Funct SNN, NL-1098 SJ Amsterdam, Netherlands
[2] Univ Ioannina, Dept Comp Sci, GR-45110 Ioannina, Greece
关键词
EM algorithm; Gaussian mixture; greedy learning;
D O I
10.1023/A:1013844811137
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Learning a Gaussian mixture with a local algorithm like EM can be difficult because (i) the true number of mixing components is usually unknown, (ii) there is no generally accepted method for parameter initialization, and (iii) the algorithm can get trapped in one of the many local maxima of the likelihood function. In this paper we propose a greedy algorithm for learning a Gaussian mixture which tries to overcome these limitations. In particular, starting with a single component and adding components sequentially until a maximum number k, the algorithm is capable of achieving solutions superior to EM with k components in terms of the likelihood of a test set. The algorithm is based on recent theoretical results on incremental mixture density estimation, and uses a combination of global and local search each time a new component is added to the mixture.
引用
收藏
页码:77 / 87
页数:11
相关论文
共 19 条
[1]  
Blake C.L., 1998, UCI repository of machine learning databases
[2]   A REVIEW OF RELIABLE MAXIMUM-LIKELIHOOD ALGORITHMS FOR SEMIPARAMETRIC MIXTURE-MODELS [J].
BOHNING, D .
JOURNAL OF STATISTICAL PLANNING AND INFERENCE, 1995, 47 (1-2) :5-28
[3]  
DASGUPTA S, 1999, P IEEE S FDN COMP SC
[4]   MAXIMUM LIKELIHOOD FROM INCOMPLETE DATA VIA EM ALGORITHM [J].
DEMPSTER, AP ;
LAIRD, NM ;
RUBIN, DB .
JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-METHODOLOGICAL, 1977, 39 (01) :1-38
[5]   MAXIMUM-LIKELIHOOD-ESTIMATION OF A MIXING DISTRIBUTION [J].
DERSIMONIAN, R .
APPLIED STATISTICS-JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES C, 1986, 35 (03) :302-309
[6]  
Ghahramani Z., 1997, CRGTR961 U TOR
[7]  
LI JQ, 2000, ADV NEURAL INFORMATI, V12
[8]   THE GEOMETRY OF MIXTURE LIKELIHOODS - A GENERAL-THEORY [J].
LINDSAY, BG .
ANNALS OF STATISTICS, 1983, 11 (01) :86-94
[9]  
McLachlan GJ, 2000, WILEY SER PROB STAT, DOI 10.1002/0471721182
[10]  
MOORE A, 1999, ADV NEURAL INFORMATI, V11