Regularized mixture discriminant analysis

被引:11
作者
Halbe, Zohar [1 ]
Aladjem, Mayer [1 ]
机构
[1] Ben Gurion Univ Negev, Dept Elect & Comp Engn, IL-84105 Beer Sheva, Israel
关键词
Gaussian mixture models; model selection; Bayesian information criterion; classification; regularized discriminant analysis;
D O I
10.1016/j.patrec.2007.06.009
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, we seek a Gaussian mixture model (GMM) of the class-conditional densities for plug-in Bayes classification. We propose a method for setting the number of the components and the covariance matrices of the class-conditional GMMs. It compromises between simplicity of the model selection based on the Bayesian information criterion (BIC) and the high accuracy of the model selection based on the cross-validation (CV) estimate of the correct classification rate. We apply an idea of Friedman [Friedman, J.H. 1989. Regularized discriminant analysis. J. Amer. Statist. Assoc., 84, 165-175] to shrink a predefined covariance matrix to a parameterization with substantially reduced degrees of freedom (reduced number of the adjustable parameters). Our method differs from the original Friedman's method by the meaning of the shrinkage. We operate on matrices computed for a certain class while the Friedman's method shrinks matrices from different classes. We compare our method with the conventional methods for setting the GMMs based on the BIC and CV. The experimental results show that our method has the potential to produce parameterizations of the covariance matrices of the GMMs which are better than the parameterizations used in other methods. We observed significant enlargement of the correct classification rates for our method with respect to the other methods which is more pronounced as the training sample size decreases. The latter implies that our method could be an attractive choice for applications based on a small number of training observations. (C) 2007 Elsevier B.V. All rights reserved.
引用
收藏
页码:2104 / 2115
页数:12
相关论文
共 27 条
[1]  
[Anonymous], 2000, WILEY SERIES PROBABI
[2]   MODEL-BASED GAUSSIAN AND NON-GAUSSIAN CLUSTERING [J].
BANFIELD, JD ;
RAFTERY, AE .
BIOMETRICS, 1993, 49 (03) :803-821
[3]   Regularized Gaussian discriminant analysis through eigenvalue decomposition [J].
Bensmail, H ;
Celeux, G .
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 1996, 91 (436) :1743-1748
[4]   Assessing a mixture model for clustering with the integrated completed likelihood [J].
Biernacki, C ;
Celeux, G ;
Govaert, G .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2000, 22 (07) :719-725
[5]   Choosing models in model-based clustering and discriminant analysis [J].
Biernacki, C ;
Govaert, G .
JOURNAL OF STATISTICAL COMPUTATION AND SIMULATION, 1999, 64 (01) :49-71
[6]  
Bishop CM., 1995, Neural networks for pattern recognition
[7]  
Blake C.L., 1998, UCI repository of machine learning databases
[8]   GAUSSIAN PARSIMONIOUS CLUSTERING MODELS [J].
CELEUX, G ;
GOVAERT, G .
PATTERN RECOGNITION, 1995, 28 (05) :781-793
[9]   MAXIMUM LIKELIHOOD FROM INCOMPLETE DATA VIA EM ALGORITHM [J].
DEMPSTER, AP ;
LAIRD, NM ;
RUBIN, DB .
JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-METHODOLOGICAL, 1977, 39 (01) :1-38
[10]  
Duda RO, 2006, PATTERN CLASSIFICATI