An EM algorithm for the block mixture model

被引:59
作者
Govaert, G
Nadif, M
机构
[1] Univ Technol Compiegne, CNRS, UMR 6599, HEUDIASYC, F-60205 Compiegne, France
[2] Univ Metz, Inst Univ Technol, F-57045 Metz, France
关键词
block mixture model; EM algorithm; variational approximation;
D O I
10.1109/TPAMI.2005.69
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Although many clustering procedures aim to construct an optimal partition of objects or, sometimes, of variables, there are other methods, called block clustering methods, which consider simultaneously the two sets and organize the data into homogeneous blocks. Recently, we have proposed a new mixture model called block mixture model which takes into account this situation. This model allows one to embed simultaneous clustering of objects and variables in a mixture approach. We have studied this probabilistic model under the classification likelihood approach and developed a new algorithm for simultaneous partitioning based on the Classification EM algorithm. In this paper, we consider the block clustering problem under the maximum likelihood approach and the goal of our contribution is to estimate the parameters of this model. Unfortunately, the application of the EM algorithm for the block mixture model cannot be made directly; difficulties arise due to the dependence structure in the model and approximations are required. Using a variational approximation, we propose a generalized EM algorithm to estimate the parameters of the block mixture model and, to illustrate our approach, we study the case of binary data by using a Bernoulli block mixture.
引用
收藏
页码:643 / 647
页数:5
相关论文
共 15 条
[1]  
BOCK H, 1979, ANAL DONNEES INFORM, P187
[2]   A CLASSIFICATION EM ALGORITHM FOR CLUSTERING AND 2 STOCHASTIC VERSIONS [J].
CELEUX, G ;
GOVAERT, G .
COMPUTATIONAL STATISTICS & DATA ANALYSIS, 1992, 14 (03) :315-332
[3]   Effects of polysilicon gate doping concentration on plasma charging damage in ultrathin gate oxides [J].
Chen, CC ;
Lin, HC ;
Chang, CY ;
Huang, TY ;
Chien, CH ;
Liang, MS .
ELECTROCHEMICAL AND SOLID STATE LETTERS, 2000, 3 (02) :103-105
[4]   MAXIMUM LIKELIHOOD FROM INCOMPLETE DATA VIA EM ALGORITHM [J].
DEMPSTER, AP ;
LAIRD, NM ;
RUBIN, DB .
JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-METHODOLOGICAL, 1977, 39 (01) :1-38
[5]  
Dhillon I. S., 2001, P 7 ACM SIGKDD INT C, P269, DOI DOI 10.1145/502512.502550
[6]   Comparison of the mixture and the classification maximum likelihood in cluster analysis with binary data [J].
Govaert, G ;
Nadif, M .
COMPUTATIONAL STATISTICS & DATA ANALYSIS, 1996, 23 (01) :65-81
[7]  
Govaert G., 1995, Control and Cybernetics, V24, P437
[8]  
Govaert G, 2003, PATTERN RECOGN, V36, P463, DOI 10.1016/S0031-3203(02)00074-2
[9]  
Govaert G., 1983, THESIS U PARIS 6 FRA
[10]  
Hartigan J. A., 1975, CLUSTERING ALGORITHM