Model-based cluster and discriminant analysis with the MIXMOD software

被引:106
作者
Biernacki, Christophe [1 ]
Celeux, Gilles
Govaert, Gerard
Langrognet, Florent
机构
[1] CNRS, UMR 8524, F-59655 Villeneuve Dascq, France
[2] Univ Lille 1, F-59655 Villeneuve Dascq, France
[3] INRIA Futurs, F-91405 Orsay, France
[4] Univ Technol Compiegne, F-60205 Compiegne, France
[5] CNRS, UMR 6599, F-60205 Compiegne, France
[6] Univ Franche Comte, F-25030 Besancon, France
[7] CNRS, UMR 6623, F-25030 Besancon, France
关键词
Gaussian models; EM-like algorithms; model selection;
D O I
10.1016/j.csda.2005.12.015
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
The Mixture Modeling (MIXMOD) program fits mixture models to a given data set for the purposes of density estimation, clustering or discriminant analysis. A large variety of algorithms to estimate the mixture parameters are proposed (EM, Classification EM, Stochastic EM), and it is possible to combine these to yield different strategies for obtaining a sensible maximum for the likelihood (or complete-data likelihood) function. MIXMOD is currently intended to be used for multivariate Gaussian mixtures, and fourteen different Gaussian models can be distinguished according to different assumptions regarding the component variance matrix eigenvalue decomposition. Moreover, different information criteria for choosing a parsimonious model (the number of mixture components, for instance) are included, their suitability depending on the particular perspective (cluster analysis or discriminant analysis). Written in C++, MIXMOD is interfaced with SCILAB and MATLAB. The program, the statistical documentation and the user guide are available on the internet at the following address: http://www-math.univ-fcomte.fr/mixmod/index.php. (c) 2006 Elsevier B.V. All rights reserved.
引用
收藏
页码:587 / 600
页数:14
相关论文
共 25 条
[1]  
[Anonymous], 1985, Computational Statistics Quarterly, DOI DOI 10.1155/2010/874592
[2]  
[Anonymous], EM ALGORITHM
[3]   MODEL-BASED GAUSSIAN AND NON-GAUSSIAN CLUSTERING [J].
BANFIELD, JD ;
RAFTERY, AE .
BIOMETRICS, 1993, 49 (03) :803-821
[4]   Regularized Gaussian discriminant analysis through eigenvalue decomposition [J].
Bensmail, H ;
Celeux, G .
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 1996, 91 (436) :1743-1748
[5]   Assessing a mixture model for clustering with the integrated completed likelihood [J].
Biernacki, C ;
Celeux, G ;
Govaert, G .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2000, 22 (07) :719-725
[6]   Choosing starting values for the EM algorithm for getting the highest likelihood in multivariate Gaussian mixture models [J].
Biernacki, C ;
Celeux, G ;
Govaert, G .
COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2003, 41 (3-4) :561-575
[7]   A generalized discriminant rule when training population and test population differ on their descriptive parameters [J].
Biernacki, C ;
Beninel, F ;
Bretagnolle, V .
BIOMETRICS, 2002, 58 (02) :387-397
[8]   Choosing models in model-based clustering and discriminant analysis [J].
Biernacki, C ;
Govaert, G .
JOURNAL OF STATISTICAL COMPUTATION AND SIMULATION, 1999, 64 (01) :49-71
[9]   An improvement of the NEC criterion for assessing the number of clusters in a mixture model [J].
Biernacki, C ;
Celeux, G ;
Govaert, G .
PATTERN RECOGNITION LETTERS, 1999, 20 (03) :267-272
[10]  
Bozdogan H., 1993, Information and classification: Concepts, methods and applications, P40, DOI DOI 10.1007/978-3-642-50974-2_5