Comparison of the mixture and the classification maximum likelihood in cluster analysis with binary data

被引:20
作者
Govaert, G
Nadif, M
机构
[1] UTC,URA CNRS 817,F-60205 COMPIEGNE,FRANCE
[2] UNIV METZ,LRIM,F-57045 METZ,FRANCE
关键词
maximum likelihood; classification maximum likelihood; equal or unknown mixing proportions; simulation comparison;
D O I
10.1016/S0167-9473(96)00021-7
中图分类号
TP39 [计算机的应用];
学科分类号
081203 [计算机应用技术]; 0835 [软件工程];
摘要
In this paper we propose to extend the comparison between the maximum likelihood and the classification maximum likelihood approaches for the Gaussian mixture (Ganesalingam, 1989; Celeux and Govaert, 1993) in the case of binary data, To this end, we use Bernoulli distribution mixtures. As with continuous data, two situations are discussed: first where mixing proportions are taken to be equal and secondly where they are unknown. The comparison realized with Monte-Carlo numerical experiments confirms the results obtained with continuous data. The choice of the approach depends on the size of the sample but assumptions about the mixing proportions are more important than the choice between the two approaches.
引用
收藏
页码:65 / 81
页数:17
相关论文
共 12 条
[1]
CLUSTERING CRITERIA FOR DISCRETE-DATA AND LATENT CLASS MODELS [J].
CELEUX, G ;
GOVAERT, G .
JOURNAL OF CLASSIFICATION, 1991, 8 (02) :157-176
[2]
A CLASSIFICATION EM ALGORITHM FOR CLUSTERING AND 2 STOCHASTIC VERSIONS [J].
CELEUX, G ;
GOVAERT, G .
COMPUTATIONAL STATISTICS & DATA ANALYSIS, 1992, 14 (03) :315-332
[3]
Celeux G., 1993, J STAT COMPUT SIM, V47, P127, DOI DOI 10.1080/00949659308811525
[4]
DAY NE, 1969, BIOMETRIKA, V56, P464
[5]
MAXIMUM LIKELIHOOD FROM INCOMPLETE DATA VIA EM ALGORITHM [J].
DEMPSTER, AP ;
LAIRD, NM ;
RUBIN, DB .
JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-METHODOLOGICAL, 1977, 39 (01) :1-38
[6]
CLASSIFICATION AND MIXTURE APPROACHES TO CLUSTERING VIA MAXIMUM-LIKELIHOOD [J].
GANESALINGAM, S .
APPLIED STATISTICS-JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES C, 1989, 38 (03) :455-466
[7]
GOVAERT G, 1990, REV STAT APPL, V38, P67
[8]
NADIF M, 1993, REV STAT APPL, V41, P55
[9]
Schroeder A., 1976, REV STATISTIQUES APP, V24, P39
[10]
CLUSTERING METHODS BASED ON LIKELIHOOD RATIO CRITERIA [J].
SCOTT, AJ ;
SYMONS, MJ .
BIOMETRICS, 1971, 27 (02) :387-&