Model-based classification via mixtures of multivariate t-distributions

被引:76
作者
Andrews, Jeffrey L. [1 ]
McNicholas, Paul D. [1 ]
Subedi, Sanjeena [1 ]
机构
[1] Univ Guelph, Dept Math & Stat, Guelph, ON N1G 2W1, Canada
基金
加拿大创新基金会;
关键词
Classification; Food authenticity; Mixture models; Model-based classification; Multivariate t-distributions; HIGH-DIMENSIONAL DATA; MAXIMUM-LIKELIHOOD; ORIGIN;
D O I
10.1016/j.csda.2010.05.019
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
A novel model-based classification technique is introduced based on mixtures of multivariate t-distributions. A family of four mixture models is defined by constraining, or not, the covariance matrices and the degrees of freedom to be equal across mixture components. Parameters for each of the resulting four models are estimated using a multicycle expectation-conditional maximization algorithm, where convergence is determined using a criterion based on the Aitken acceleration. A straightforward, but very effective, technique for the initialization of the unknown component memberships is introduced and compared with a popular, more sophisticated, initialization procedure. This novel four-member family is applied to real and simulated data, where it gives good classification performance, even when compared with more established techniques. (C) 2010 Elsevier B.V. All rights reserved.
引用
收藏
页码:520 / 529
页数:10
相关论文
共 44 条
[1]  
Aitken A. C. XXV., 1927, Proc. R. Soc. Edinburgh, V46, P289, DOI DOI 10.1017/S0370164600022070
[2]   Extending mixtures of multivariate t-factor analyzers [J].
Andrews, Jeffrey L. ;
McNicholas, Paul D. .
STATISTICS AND COMPUTING, 2011, 21 (03) :361-373
[3]  
[Anonymous], 2000, Sankhya Ser. A, DOI DOI 10.2307/25051289
[4]  
[Anonymous], 2002, Algorithms for Minimization Without Derivatives
[5]   Assessing a mixture model for clustering with the integrated completed likelihood [J].
Biernacki, C ;
Celeux, G ;
Govaert, G .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2000, 22 (07) :719-725
[6]  
BINDER DA, 1978, BIOMETRIKA, V65, P31, DOI 10.2307/2335273
[7]   THE DISTRIBUTION OF THE LIKELIHOOD RATIO FOR MIXTURES OF DENSITIES FROM THE ONE-PARAMETER EXPONENTIAL FAMILY [J].
BOHNING, D ;
DIETZ, E ;
SCHAUB, R ;
SCHLATTMANN, P ;
LINDSAY, BG .
ANNALS OF THE INSTITUTE OF STATISTICAL MATHEMATICS, 1994, 46 (02) :373-388
[8]   High-dimensional data clustering [J].
Bouveyron, C. ;
Girard, S. ;
Schmid, C. .
COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2007, 52 (01) :502-519
[9]   GAUSSIAN PARSIMONIOUS CLUSTERING MODELS [J].
CELEUX, G ;
GOVAERT, G .
PATTERN RECOGNITION, 1995, 28 (05) :781-793
[10]  
Cook D, 2007, USE R, P1, DOI 10.1007/978-0-387-71762-3