Robust clustering by deterministic agglomeration EM of mixtures of multivariate t-distributions

被引:119
作者
Shoham, S [1 ]
机构
[1] Univ Utah, Dept Bioengn, Salt Lake City, UT 84112 USA
关键词
clustering; finite mixture models; EM algorithm; robust algorithms; t-distribution; deterministic annealing; agglomerative algorithms;
D O I
10.1016/S0031-3203(01)00080-2
中图分类号
TP18 [人工智能理论];
学科分类号
081104 [模式识别与智能系统]; 0812 [计算机科学与技术]; 0835 [软件工程]; 1405 [智能科学与技术];
摘要
This paper presents new robust clustering algorithms, which significantly improve upon the noise and initialization sensitivity of traditional mixture decomposition algorithms, and simplify the determination of the optimal number of clusters in the data set. The algorithms implement maximum likelihood mixture decomposition of multivariate t-distributions, a robust parametric extension of gaussian mixture decomposition. We achieve improved convergence capability relative to the expectation-maximization (EM) approach by deriving deterministic annealing EM (DAEM) algorithms for this mixture model and turning them into agglomerative algorithms (going through a monotonically decreasing number of components), an approach we term deterministic agglomeration EM (DAGEM). Two versions are derived, based on two variants of DAEM for mixture models. Simulation studies demonstrate the algorithms' performance for mixtures with isotropic and non-isotropic covariances in two and 10 dimensions with known or unknown levels of outlier contamination. (C) 2002 Pattern Recognition Society. Published by Elsevier Science Ltd. All rights reserved.
引用
收藏
页码:1127 / 1142
页数:16
相关论文
共 33 条
[1]
[Anonymous], 2000, WILEY SERIES PROBABI
[2]
[Anonymous], Pattern Recognition With Fuzzy Objective Function Algorithms
[3]
MODEL-BASED GAUSSIAN AND NON-GAUSSIAN CLUSTERING [J].
BANFIELD, JD ;
RAFTERY, AE .
BIOMETRICS, 1993, 49 (03) :803-821
[4]
Structure learning in conditional probability models via an entropic prior and parameter extinction [J].
Brand, M .
NEURAL COMPUTATION, 1999, 11 (05) :1155-1182
[5]
MIXTURE-MODELS AND ATYPICAL VALUES [J].
CAMPBELL, NA .
JOURNAL OF THE INTERNATIONAL ASSOCIATION FOR MATHEMATICAL GEOLOGY, 1984, 16 (05) :465-477
[6]
Robust clustering methods: A unified view [J].
Dave, RN ;
Krishnapuram, R .
IEEE TRANSACTIONS ON FUZZY SYSTEMS, 1997, 5 (02) :270-293
[7]
MAXIMUM LIKELIHOOD FROM INCOMPLETE DATA VIA EM ALGORITHM [J].
DEMPSTER, AP ;
LAIRD, NM ;
RUBIN, DB .
JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-METHODOLOGICAL, 1977, 39 (01) :1-38
[8]
FIGUEIREDO M, 1999, ENERGY MINIMIZATION, P45
[9]
FIGUEIREDO M, 2000, INT C PATT REC ICPR