Inference in model-based cluster analysis

被引:125
作者
Bensmail, H
Celeux, G
Raftery, AE
Robert, CP
机构
[1] UNIV WASHINGTON, DEPT STAT, SEATTLE, WA 98195 USA
[2] INRIA RHONE ALPES, ZIRST, F-38330 MONTBONNET ST MAR, FRANCE
[3] INSEE, CREST, F-92245 MALAKOFF, FRANCE
关键词
Bayes factor; eigenvalue decomposition; Gaussian mixture; Gibbs sampler;
D O I
10.1023/A:1018510926151
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
A new approach to cluster analysis has been introduced based on parsimonious geometric modelling of the within-group covariance matrices in a mixture of multivariate normal distributions, using hierarchical agglomeration and iterative relocation. It works well and is widely used via the MCLUST software available in S-PLUS and StatLib. However, it has several limitations: there is no assessment of the uncertainty about the classification, the partition can be suboptimal, parameter estimates are biased, the shape matrix has to be specified by the user, prior group probabilities are assumed to be equal, the method for choosing the number of groups is based on a crude approximation, and no formal way of choosing between the various possible models is included. Here, we propose a new approach which overcomes all these difficulties. It consists of exact Bayesian inference via Gibbs sampling, and the calculation of Bayes factors (for choosing the model and the number of groups) from the output using the Laplace-Metropolis estimator. It works well in several real and simulated examples.
引用
收藏
页码:1 / 10
页数:10
相关论文
共 22 条
[1]  
[Anonymous], 1993, REV MODULAD
[2]  
[Anonymous], PRACTICAL MARKOV CHA
[3]   MODEL-BASED GAUSSIAN AND NON-GAUSSIAN CLUSTERING [J].
BANFIELD, JD ;
RAFTERY, AE .
BIOMETRICS, 1993, 49 (03) :803-821
[4]   GAUSSIAN PARSIMONIOUS CLUSTERING MODELS [J].
CELEUX, G ;
GOVAERT, G .
PATTERN RECOGNITION, 1995, 28 (05) :781-793
[5]  
DIEBOLT J, 1994, J ROY STAT SOC B MET, V56, P363
[6]   BAYESIAN STATISTICAL-INFERENCE FOR PSYCHOLOGICAL-RESEARCH [J].
EDWARDS, W ;
LINDMAN, H ;
SAVAGE, LJ .
PSYCHOLOGICAL REVIEW, 1963, 70 (03) :193-242
[7]   BAYES FACTORS [J].
KASS, RE ;
RAFTERY, AE .
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 1995, 90 (430) :773-795
[8]   A BAYESIAN METHOD FOR CLASSIFICATION AND DISCRIMINATION [J].
LAVINE, M ;
WEST, M .
CANADIAN JOURNAL OF STATISTICS-REVUE CANADIENNE DE STATISTIQUE, 1992, 20 (04) :451-461
[9]  
LEWIS SM, 1997, IN PRESS J AM STAT A
[10]   SEPARATING MIXTURES OF NORMAL DISTRIBUTIONS [J].
MARRIOTT, FHC .
BIOMETRICS, 1975, 31 (03) :767-769