Bayesian regularization for normal mixture estimation and model-based clustering

被引:13
作者
Fraley, Chris [1 ]
Raftery, Adrian E. [1 ]
机构
[1] Univ Washington, Dept Stat, Seattle, WA 98195 USA
关键词
BIC; EM algorithm; mixture models; model-based clustering; conjugate prior; posterior mode;
D O I
10.1007/s00357-007-0004-5
中图分类号
O1 [数学];
学科分类号
0701 [数学]; 070101 [基础数学];
摘要
Normal mixture models are widely used for statistical modeling of data, including cluster analysis. However maximum likelihood estimation (MLE) for normal mixtures using the EM algorithm may fail as the result of singularities or degeneracies. To avoid this, we propose replacing the MLE by a maximum a posteriori (MAP) estimator, also found by the EM algorithm. For choosing the number of components and the model parameterization, we propose a modified version of BIC, where the likelihood is evaluated at the MAP instead of the MLE. We use a highly dispersed proper conjugate prior, containing a small fraction of one observation's worth of information. The resulting method avoids degeneracies and singularities, but when these are not present it gives similar results to the standard method using MLE, EM and BIC.
引用
收藏
页码:155 / 181
页数:27
相关论文
共 58 条
[1]
[Anonymous], 1997, TECHNICAL REPORT
[2]
MODEL-BASED GAUSSIAN AND NON-GAUSSIAN CLUSTERING [J].
BANFIELD, JD ;
RAFTERY, AE .
BIOMETRICS, 1993, 49 (03) :803-821
[3]
A POPULATION AND FAMILY STUDY OF N-ACETYLTRANSFERASE USING CAFFEINE URINARY METABOLITES [J].
BECHTEL, YC ;
BONAITIPELLIE, C ;
POISSON, N ;
MAGNETTE, J ;
BECHTEL, PR .
CLINICAL PHARMACOLOGY & THERAPEUTICS, 1993, 54 (02) :134-141
[4]
Inference in model-based cluster analysis [J].
Bensmail, H ;
Celeux, G ;
Raftery, AE ;
Robert, CP .
STATISTICS AND COMPUTING, 1997, 7 (01) :1-10
[5]
Regularized Gaussian discriminant analysis through eigenvalue decomposition [J].
Bensmail, H ;
Celeux, G .
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 1996, 91 (436) :1743-1748
[6]
A novel approach for clustering proteomics data using Bayesian fast Fourier transform [J].
Bensmail, H ;
Golek, J ;
Moody, MM ;
Semmes, JO ;
Haoudi, A .
BIOINFORMATICS, 2005, 21 (10) :2210-2224
[7]
Bensmail H, 2003, J CLASSIF, V20, P49, DOI 10.1007/s00357-003-00-05-5
[8]
Structure learning in conditional probability models via an entropic prior and parameter extinction [J].
Brand, M .
NEURAL COMPUTATION, 1999, 11 (05) :1155-1182
[9]
Linear flaw detection in woven textiles using model-based clustering [J].
Campbell, JG ;
Fraley, C ;
Murtagh, F ;
Raftery, AE .
PATTERN RECOGNITION LETTERS, 1997, 18 (14) :1539-1548
[10]
Campbell JG, 1999, INT J IMAG SYST TECH, V10, P339, DOI 10.1002/(SICI)1098-1098(1999)10:4<339::AID-IMA5>3.0.CO