Unsupervised learning of finite mixture models

被引:1487
作者
Figueiredo, MAT [1 ]
Jain, AK
机构
[1] Inst Super Tecn, Inst Telecommun, P-1049001 Lisbon, Portugal
[2] Inst Super Tecn, Dept Elect & Comp Engn, P-1049001 Lisbon, Portugal
[3] Michigan State Univ, Dept Comp Sci & Engn, E Lansing, MI 48824 USA
关键词
finite mixtures; unsupervised learning; model selection; minimum message length criterion; Bayesian methods; expectation-maximization algorithm; clustering;
D O I
10.1109/34.990138
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper proposes an unsupervised algorithm for learning a finite mixture model from multivariate data. The adjective "unsupervised" is justified by two properties of the algorithm: 1) it is capable of selecting the number of components and 2) unlike the standard expectation-maximization (EM) algorithm, it does not require careful initialization. The proposed method also avoids another drawback of EM for mixture fitting: the possibility of convergence toward a singular estimate at the boundary of the parameter space. The novelty of our approach is that we do not use a model selection criterion to choose one among a set of preestimated candidate models; instead, we seamlessly integrate estimation and model selection in a single algorithm. Our technique can be applied to any type of parametric mixture model for which it is possible to write an EM algorithm; in this paper, we illustrate it with experiments involving Gaussian mixtures. These experiments testify for the good performance of our approach.
引用
收藏
页码:381 / 396
页数:16
相关论文
共 64 条
[1]  
[Anonymous], 329 U WASH DEP STAT, DOI DOI 10.1093/oep/gpr002
[2]   MODEL-BASED GAUSSIAN AND NON-GAUSSIAN CLUSTERING [J].
BANFIELD, JD ;
RAFTERY, AE .
BIOMETRICS, 1993, 49 (03) :803-821
[3]   Inference in model-based cluster analysis [J].
Bensmail, H ;
Celeux, G ;
Raftery, AE ;
Robert, CP .
STATISTICS AND COMPUTING, 1997, 7 (01) :1-10
[4]  
Bernardo J.M., 2009, Bayesian Theory, V405
[5]  
Bertsekas D. P., 1999, NONLINEAR PROGRAMMIN, V2nd
[6]   Assessing a mixture model for clustering with the integrated completed likelihood [J].
Biernacki, C ;
Celeux, G ;
Govaert, G .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2000, 22 (07) :719-725
[7]   An improvement of the NEC criterion for assessing the number of clusters in a mixture model [J].
Biernacki, C ;
Celeux, G ;
Govaert, G .
PATTERN RECOGNITION LETTERS, 1999, 20 (03) :267-272
[8]  
Biernacki C, 1997, COMPUTING SCI STAT, V29, P451
[9]  
Bozdogan H., 1993, Information and classification: Concepts, methods and applications, P40, DOI DOI 10.1007/978-3-642-50974-2_5
[10]   Structure learning in conditional probability models via an entropic prior and parameter extinction [J].
Brand, M .
NEURAL COMPUTATION, 1999, 11 (05) :1155-1182